THE HEALTH AND ECONOMIC EFFECTS OF DRINKING WATER
         David McCormick and Cristine Candela
                Bendix Corporation
              Bendbr
Applied Science
& Technology
Division

-------
THE HEALTH AND ECONOMIC EFFECTS OF

             DRINKING WATER
                     By


              David McCormick

               Cristine Candela
          Contract No. 68-01-1867
          Project No.  WA73-A368
               Project Officer

              Dennis Tihansky
 Washington Environmental Research Center
             Washington,  D.  C.
               Prepared for

     Office of Research and Development
    U.S. Environmental Protection Agency
           Washington,  D.  C. 20460

-------
                  EPA Review Notice
This report has been reviewed by the Environmental
Protection Agency and approved for publication.
Approval does not signify that the contents neces-
sarily reflect the views and policies of the Envi-
ronmental  Protection Agency,  nor does mention of
trade names or  commercial products constitute
endorsement or recommendation of use.

-------
                             ABSTRACT
       This study examines the relationship between sub-clinical common
illness such as upset stomach and other low level diseases,  and quality of
drinking water and water treatment plant characteristics. Such illnesses
account for the greatest occurrence of health problems in the country.
For example, in 1969 the National Health Survey found that between 67
and 72 percent of disability days arising from illness fell into the sub-
clinical category.

       The basic hypothesis tested was that if one accounts for all macro -
systemic influences on health, the occurrence of sub-clinical illnesses
will be highest in those communities with the worst water quality.  The
measures of drinking water quality and water treatment plant character-
istics were obtained from a Public Health Survey study entitled, "Commu-
nity Water Supply Study".   Elementary school absenteeism was used as a
surrogate measure for sub-clinical disease.  The primary statistical tool
employed was stepwise multiple regression.

       A linear regression was performed of elementary  school
absenteeism onto total plate counts,  measures of water treatment facilities,
and a series  of control variables. The resulting regression was essentially
a damage function from which percentage absences resulting from incre-
mental changes in water quality were predicted.   Under the assumption
that low  level illness in the adult population is affected by a constant
proportion to childrens1 illness,  the number of drinking water-related
absences and their associated dollar value were  estimated.
                                  111

-------
                           CONTENTS
Abstract

Sections
I        Conclusions                                              1

II        Recommendations                                         2

III       Problem and Approach                                    3
              Background                                         3
              Community Water Supply Study, 1970                 3
              This Study                                          4

IV       Literature Review                                        7
              Human Health Effects from Water Pollution          7
              Waterborne Disease Outbreaks                       8
              Viruses in Water Supply                            12
              Health and Mineral Characteristics of Water
                 Supplies                                         16
              Coliform  and Pathogenic Organisms                 17
              Bathing and Recreational Water                     18
              Summertime Gastrointestinal Illness                19
              Benefits from Reduction of Waterborne
                 Disease Outbreaks                              20
              Summary                                          22

V        Research Design                                        25
              The Problem                                      25
              Data                                              26
                    Absenteeism Data                            26
                    Water Quality Data                           30
                    Air Quality Data                              32
                    Socio-Economic Data                         33
                    Climate                                      34
              Hypotheses                                        35
              Analytical Methodology                             40
              Missing Data                                      41
              Case Selection and Aggregation                     43
              The Test                                          45
                    A General Model                             45
                    Regional Analyses                            47
                    Six Area Grouping - Rural                    50
                    Six Area Grouping - Urban/Suburban          51
              Effects                                            54
              Conclusions                                        64

VI       Bibliography                                            66

Appendices                                                      71
         A.  California Results
         B.  The Profile of Medical Practice

                                iv

-------
                             SECTION I

                           CONCLUSIONS
       A consistent relationship was found between operator salary and
elementary school absenteeism,  controlling for community wealth and
urbanization.  The frequency of testing finished water, times the salary
of the operator(s), results in a useful index of the operation of a water
treatment facility.

       Regression analyses allowed us to predict the percentage of
elementary school absences which result from incremental changes in
water quality.  The results broadly  indicated that a . 2 to . 5 percent
increase in school attendance can be expected for each $1000 increase
in operator salary.  Nationwide and extrapolating to the adult population,
improvements of this magnitude would produce savings (in wages and
medical expenses) of approximately $250 million per year.

       Conclusively, our findings show that water treatment plant
operators'  salaries appear to be a useful predictor of elementary
school absenteeism.  However, one cannot assume that pay raises
will immediately improve  community health.  The very low salaries
which are now prevalent have resulted in the recruitment of poorly
educated and undertrained operators. While, in time, higher salaries
will make possible the selection  of higher caliber operators, in the
short term, the salaries would have to be supplemented by vigorous
operator improvement programs if any significant result is to be
expected.

       Therefore, we computed  benefits  on the basis of increments in
improvements in the various measures of water quality.  The various
increments predict reductions of 3% to 10% of the digestive and  non-
respiratory illnesses.  In  the specific areas  studied, this translated to
reductions in elementary school  absences on the order of 300, 000 to
600,000 absences per year per incremental improvements.   Converting
this to worker absenteeism using a constant of proportionality of . 62,
this becomes a savings in  the order of 23 million dollars per year per
increment.  For the nationwide sample, we would cautiously estimate
that elementary school absences would be reduced on the order  of
magnitude  of 11-12 million absences per  year per  incremental im-
provement. The wages and medical expenses saved would range in
the order of 250 million dollars per  year  per incremental improvement.

-------
                              SECTION II

                          REG OMMENDATIONS
       In carrying out the study,  every attempt was made to be conser-
vative in our estimates.  The medical cost figures,  for example,  assume
that there are no respiratory diseases resulting from drinking water.  If
they were transmitted in a manner similar to gastrointestinal illness,
the medical cost figures would be more than twice what we have estimated.
At all other points  in the research program, when we were faced with two
alternatives, one of which would tend to inflate the results and the other
of which would tend to deflate them, we systematically chose the more
conservative.

       We believe  that the results indicate that if improvements are made
in the quality of drinking water,  substantial savings  will be realized.  As
the CWSS report suggested and our study has confirmed,  the problems are
most  severe in the rural non-farm communities.  There the economic
savings alone would be most dramatic both because  the effects are rela-
tively large, and the costs of resolving  the problem are relatively small.
Inmost cases, upgrading the personnel and procedures appears to be all
that is needed.

-------
                             SECTION -IK

                        PROBLEM AND APPROACH

Background

      Conventional rehtoric used to be that U.S. drinking water was of the
highest quality and purity.  In fact,  most Americans took-water purity for
granted, considering it a fact of life.  Recent studies,  however, have served
to dispel such inaccuracies. Besides the alarming discoveries of carcino-
genic agents in drinking water, there have been analyses such as Craun and
McCabe's findings that I) water-related epidemics affecting an average of 100
persons per outbreak occur at an average rate of one per month; " 2)
McDermott's estimates that of annual occurrences of 500, 000 infections of
hepatitis and two million of aseptic meningitis,  both can be transmitted by
water^»; and 3) Mosley's 1967 survey of waterborne disease viruses, which
opened a wide range of possible links to major health problems. )

Community Water Supply Study, 1970

      In July 1970, the Department of Health, Education, and Welfare
published results of their own nation-wide evaluation of drinking water
quality and water treatment plants.   This Community Water Supply Study
(CWSS) shows that the quality of drinking water is far below general ex-
pectations.4'    Significant results were:

         Forty-one percent of the water systems,  delivered inferior
         water to  14% of the study population.

         More than one-third of the tap water samples  contained con-
         taminants in excess of Public Health Service standards.

         Over half of the  systems had physical or operational de-
         ficiencies.
1)   G. Craun,  L. McCabe, "Waterborne Disease Outbreaks,  1961-1970".
     Annual Meeting of the American Water Works Association,  1971.

2)   J. H. McDermott,  "Virus and Water Quality:  Occurrence and Control -
    . Conference Summary"  (13th Water Quality Conference, February 1971),
     p. 213.

3)    Mosley, James, "Transmission of Viral Diseases by Drinking Water"
      from Bery (eds.)  Transmission of Viral Diseases by the Water Route,
      New York, 1965.

4)    McCabe, Leland J. , J. Symons, R. Lee,  and G.  G. Robeck,  "Survey
      of Community Water Supply System",  Journal of the American Water -
      works Association, Vol.  62,  No. 11,  November 1970, p. 670-687.

-------
      •   More than three-fourths of the plant operators had inadequate
         training in fundamental water microbiology; nearly half in
         application chemistry.

      •   Control,  inspection, surveillance, and testing programs were
         entirely lacking,  infrequently performed,  or well below stand-
         ards in the vast majority of systems.

      Surprising as they may be, the CWSS results are probably optimistic,
since water quality measurements and standards suffer from a lack of virus
data. Direct methods for  measuring viruses in large quantities of water are
under development but not yet available.  Although present water purification
technology can handle all significant bacterial problems,  in practice, availa-
ble technology is not always applied.  And it is far from certain that viruses
and harmful  chemicals can be easily removed with existing technology.

      In recent years, the American press and popular literature have re-
flected a growing public awareness of environmental pollution. At the same
time, public  attention has  focused on the cost in higher taxes and consumer
prices to clean it up.  Confronted with this price tag, taxpayers and the
decision-makers who represent them naturally ask what they're buying.

      In the area of water  pollution control, there are numerous payoffs, including
direct economic benefits such as commercial fishing, measured real estate
values,  increased recreational use, and generally improved aesthetic value,
all of which contribute to improving quality of life for Americans.

      Among the costs incurred by our  degraded water supplies are those of
purifying it to make it safe to drink.  Alternatively,  of course, failure to purify
the water sufficiently can  be argued to increase the possibility of illness.

      However, the question arises whether and to what extent incremental
degradation of the drinking water has negative effects on community health.
If  such effects exist, there are associated, definitive economic implications
both on the cost of operating the treatment facilities and on the costs  of im-
proving the quality of the intake water quality.

This Study

      Numerous studies have already been undertaken to  examine the effects'
of water degradation on serious communicable disease.  In view of this fact,
this study proposed an examination of data relating to sub-clinical illnesses
such as gastroenteritis and other non-specific low level diseases.  Such
illnesses account for high occurrence of health problems in the United States.
In 1969, for  example, the  National Health Survey found that between 67 and
72 percent of disability days  arising from illness fell into the  sub-clinical
illness  category.

      Based on the assumption that the economic costs of sub-clinical ill-
nesses affect a far wider distribution of people than do major  illnesses,
lost productivity alone due to sub-clinical absences  in 1969 totaled in the

-------
general range of $4 1/2 billion dollars.*  Consequently, the costs of sub-
clinical illness are large and,  in fact,  are more than twice the total EPA
1974 estimated budget outlays.

      This study was, therefore, undertaken with the basic hypothesis
that any significant health effects from water degradation would result pri-
marily in sub-clinical illnesses.  To perform the study, some reliable
measure was needed for community health.  Worker sick time data, the
most direct measure are not usually reported for a community.  Even where
they are available, sick time data are notoriously unreliable and can tend to
follow a form of Parkinson's Law:  sick days off expand to absorb sick days
allowable.  Therefore, a widely reported indicator of community health had
to be chosen.  We selected absenteeism from  elementary schools as our
indicator  for the following reasons:

      •  Records in most school districts are reasonably accurate
        and complete.

        Elementary school children represent a wide spectrum of
        socio-economic classes in a community.

        Truancy is less serious problem for  early grades, than for
        high school.

      •  Both children and adults are susceptible to low level illnesses;
        therefore,  school absenteeism can be linked to work absenteeism.

      •  Biases in absenteeism due to non-water related phenomena
        can be  partly explained by measurable variations in socio-
        economic community  characteristics.

      For the communities studied, we performed a linear regression of
elementary school absenteeism onto total plate counts, measures of the
treatment facilities, and a series of control variables.

      The resulting regression is essentially a damage function from which
we were able to predict the percentage absences resulting from incremental
changes in water quality.  Under the assumption that low level illness in the
adult population is affected by a constant proportion to childrens1 illness,
the number of drinking water related absences and their associated dollar
value were  estimated.  This was done for the  nine areas used in the CWSS
study: the State of Vermont, and eight standard metropolitan statistical
areas (New York, N.  Y. ;  Charleston, W. Va. ; Charleston, S. C. ;
Cincinnatti, Ohio; Kansas City, Missouri-Kansas; New Orleans, Louisiana;
Pueblo, Colorado;  and San Bernadino-Riverside-Ontario,  California.  In
   Based on 368 million disability days to persons 14 and over due to acute
   illnesses, 67% of which are sub-clinical, a median,  daily wage of $21.00,
   and 58% employment.

-------
addition, extremely general estimates were made for the total nation.

      The ensuing sections of this report will review:

         1.)  the findings from a literature review covering re-
             search on water quality relationships to low level
             illness;

         2.)  the research design and methodology, and

         3.)  description of analysis and findings.

         Five appendices list data resources. A bibliography
         provides  references relevant to the  research.

-------
                            SECTION IV

                       LITERATURE REVIEW

Human Health Effects from Water Pollution

Method - To ascertain what is already known about the costs of water
pollution with relation to low level illness, a survey of the relevant
literature was conducted.  Although several articles are discussed
and quoted with  regard to water quality and illness, the major studies
reported were reviewed critically for the utility of their objectives,
fulfillment of these objectives, and the analytical methods used.
Sources are mentioned wherever  given.  Emphasis is  given to con-
sistent findings  between water quality and illness, with attention
to geographical  aspects involved.

Related Research

There is still a  paucity of research in the area of water and human
health,  particularly with respect to low level illnesses. Also, there
is a dearth of records and statistics  dealing with minor illnesses.
In this literature review, the goal is to complement our research
objective to measure the economic costs to society in lost wages  and
medical expenses arising from low level illness associated with water
quality.  The inadequacy of background data is significant. Krishnaswami
has pointed out, "that man's direct and indirect contact with water and
the aquatic..environment varies considerably,  and that the total impact of all
environmental factors  - air, water and food  - cannot be compart-
mentalized in assessing their  significance to health and disease".5)
In a  quote from Dubos, it is stated  that:  "Experimental and
epidemiological studies (e.g., common cold and infectious hepatitis,
respectively), have shown that even  in the transmission of pathogenic
organisms and the production  of overt disease ... a simple  relation-
ship of causal specificity does not always exist.   The  presence of a
pathogen or a toxicant is only  a necessary condition, but not a
sufficient condition for the causation of a disease. " °'
5)   S. Krishnaswami,  "Health Aspects of Water Quality" American
     Journal of Public Health, 61:2260 (November  1971).  For
     a literature search of published and unpublished estimates of
     the cost of air pollution damage, see Larry B.  Barrett and
     Thomas C.  Waddell, Cost  of Air Pollution Damage:  A Status
     Report, EPA, February 1973.   Also see literature review
     contained in Lester B. Lave and Eugene P. Seskin  "Air
     Pollution and Human Health", Science 169, 21 August 1970,
     723-733.

6)   Dubos, Rene1.  Man,  Medicine and Environment.  New York:
     The New American Library, Mentor MY 942, 160 pp. 1969.

-------
Krishnaswami adds that, "The overt clinical manifestation of severe
chemical deficiency or of acute toxicity can be evaluated and their
causes discussed relatively easily.  However,  the role of water con-
stituents (relatively significant concentrations of various synthetic
organics, heavy metals, and other chemical toxicants) in the probable
multifactorial causation or contribution to chronic ailments and other
bodily stresses  have not received serious consideration. "?)

Thus,  in conducting studies in the area of water quality and minor
illness one must deal with the fact that their occurrence is negligibly
reported.  Many individuals will forego any physician consultation
during low level illnesses,  but additionally, records  that do exist
are incomplete as to causation, circumstances, length of illness, etc.

Waterborne Disease  Outbreaks
Weibel.  et al.8)

Despite such limitations, Weibel, et al, were able to compile diseases
by identity finding that the number of reported epidemics and cases
of waterborne disease outbreaks in the United States between 1946 and
I960 were primarily classified as  "gastroenteritis".   (See Table  1.)
"This category and 'diarrhea' (16 outbreaks and 5,160 cases) are  not
specific  diseases, because the etiologic agent has not been determined
through laboratory analysis.   Even with laboratory analysis, the
causative agent cannot always be determined.")
7)    S. Krishnaswami, p. 2261.

8)    S. R. Weibel,  F.  R. Dixon, R. B.  Weidner, and L. J. McCabe,
      "Waterborne Disease Outbreaks,  1946-60", Journal of the
      American Waterworks Association,  56:-949 (August 1964).

9)    S. Krishnaswami, p. 948.
                                 8

-------
                                  TABLE 1
                                           10)
            Waterborne-Disease Outbreaks in the United States,




                      1946-60, by Type of Illness and System
nine s s
Private or Semi-
public Systems
Outbreaks
Cases
Public Utilities
Outbreaks
Cases
All Systems
Outbreaks
Cases
Gastroenteritis




Typhoid




Infectious hepatitis




Diarrhea




Shigellosis




Salmonellosis




Amebiasis
Other
              Total
92
33
14
7
4
3
2
3
158
4,233
403
430
320
596
22
36
16
6, 056
3411J
6
9
9
7
1
0
4
70
9, 397
103
500
4,840
5,057
2
0
29
19,928
126
39
23
16
11
4
2
7
228
13,630
506
930
5, 160
5,653
24
36
45
25,984
 10)   Weibel, et al.
 11)   One gastroenteritis outbreak also included a typhoid case.

-------
Weibel, et al also report on a review of waterborne-disease outbreaks in
the United States and Canada by Gorman and Wolman/^'covering the period
from 1920 to 1936, in which the case distribution (116, 000) by type of ill-
ness was:  88 percent diarrhea, 11 percent typhoid fever,  and 1 percent
dysentery.  A second review by Eliassen and Cummings  'covered water-
borne outbreaks in the United States between 1938 and  1945 with a case
illness distribution (110,000) of:  91 percent gastroenteritis (which prob-
ably corresponds to the "diarrhea" classification of the earlier study); 8
percent dysentery? and,  1 percent typhoid.

The Eliassen and Cummings report used the term "outbreak" to refer to
two or more cases; the Gorman and Wolman report defined it as consist-
ing of five or more cases.

Weibel, et al also show a distribution of outbreaks by month for the years
1946-60^ which indicate  a peaking of outbreaks due to private supplies in
the summer months. (See Table  2. )14)   This is of extrinsic interest to. our
research in that our stress is rather on usage of school absenteeism, data, covering
the time periods of September to  June.  It is a finding  which is given
corroboration by more recent analyses done by Petersen and Hines,  which
will be  reported on in a later section of this report.

Unfortunately, the o~nly explanation offered with regard to causal factors of such
occurrence is either greater pollution during the summer months,  or pos-
sibly increased usage by more susceptible individuals  (tourists,  campers,
travelers, etc. ).

G. Craun, L. McCabe15)

During 1961-1970, Craun and McCabe reviewed 128 known outbreaks of
disease or poisoning attributed to drinking water, with 46, 369 illnesses
and 20 deaths.  Their definition of "outbreak" reflected at least two cases
of infectious disease, associated  with water used for drinking or domestic
   12)   Gorman, A. C. & Wolman,  A. ,  "Waterborne Outbreaks in the
        United States and Their Significance.  Journal of the American
        Water Works Association 31:222 (February 1939).

   13)   Eliassen,  R.  8t Cummings,  R. H. "Analysis of Waterborne Out-
        breaks,  1938-45" Journal AWWA, 40:509 (May 1948).

   14)   Weibel,  et al, p.  954.

   15)   G.  Craun and L. McCabe, Ibid.
                                    10

-------
                                 TABLE 2
                                           16)
              Seasonal Distribution of Waterborne-Disease

               Outbreaks in the United States,  1946-60, by

                 Type of System and Affected Population



Month



January
February.
March
April
May
June
July
August
September
October
November
December
Total
Month
unknown
Grand
total


Public
Utilities
%


7.6
10.6
7.6
7.6
6.1
3.0
12.1
9.1
9.1
6.1
9.1
12.0
100.0 .




Private or Semipublic
Systems


Percent-
age of
Total
3.9
3.9
2.6
5.9
6.6
12.5
29.0
15.8
8.6
6.6
2.0
2.6
100.0





Affected Population


1*
3
4
3
4
3
7
10
4
4
6
3
1
52

0

52
2f
1
0
0
1
2
2
0
1
1
3
0
1
12

3

15
3+
2
1
1
4
5
10
34
19
8
1
0
2
87

2

89
4§

1










1

1

2
 *Habitual users of supply.
 •^School children.
 ^Visitors.
 § Unclassified
16) S. R.  Weibel,  et al
                                     11

-------
purposes.  Data sources included state health departments,
medical and engineering literature, newspaper clippings,  polls of state
sanitary engineers and epidemiologists.

Their findings concluded that on the average, one waterborne outbreak
that is known about occurred per month with something over 100 persons
becoming ill.  Some of the illness was quite severe and about two deaths
per year occurred from waterborne outbreaks.

There were 94 outbreaks in private supplies and 34 outbreaks in public
supplies.   Most of the outbreaks were classified as gastroenteritis; these
include 38 outbreaks and 25, 800 cases.

Viruses in Water Supply

Dr.  Luther L.  Terry, U. S. Surgeon General, was quoted in '62 as saying,
"We are by no means  sure that at least some viruses are  not slipping
through our present water purification  and disinfection processes and en-
tering our water mains".  '

Jh reporting on the 13th Water Quality Conference,  McDermott  'summa-
rized the  reports on virus and water quality as follows:

         1. Most speakers agree that there is a growing  concern with
         the problem of viral pollution of water in this country.

         2. It was generally agreed that,  because the basic properties
         of viruses are poorly understood, their transmission by the
         water route has not been fully appreciated.  However,  sufficient
         evidence has been presented to support the thesis that virus  can
         be transmitted by water and result in human infection and disease.

         3. The speakers unanimously agreed that there  is insufficient
         technical data now at hand to document conclusively the idea that
         virus-free water can be attained  with our present wastewater
         treatment and domestic water supply treatment processes and
         practices.
  17)   Quoted in Report of Committee on Recreation Use of Sources of
       Public Water Supply,  N. Eng. Water Works Association, Journal
       New England Water Works Association,  85:1:92 (1971).

  18)   J.  H.  McDermott, Ibid
                                     12

-------
        4.  Thus, immediate attention should be given to additional studies
        and investigations in the following subject areas:  viral detection
        and enumeration methodology, basic properties of enteric virus,
        transmission through the aquatic environment, viral disease of
        man and associated epidemiological studies, and unit process re-
        search and development.

In a discussion of viral disease  and epidemiology McDermott reports  that
we can reasonably be assured that the transmission of human enteric
diseases are largely dependent  upon water as the vehicle.

Metcalf and Melnick  presented papers at the conference indicating that one
strain of virus may produce  illnesses with widely variable incubation  periods
and manifestations,  e. g.,  a  coxsackievirus may produce meningitis in one
person, myocarditis in another, and diarrhea in a third, etc.  ''

S. F. B. Poynter20)

S. F. B. Poynter,  Senior Bacteriologist, Metropolitan Water Board,
addresse s this same area of concern:  Infection may be brought about by
ingestion of virus contaminated food or water or by  downward spread from
the respiratory tract.   The prolonged excretion of the virus in the faeces
which follows infection causes faecal contamination  to be a major factor  in
their spread,  and probably most enterovirus epidemics are started by
symptomless excreters.

Poynter also specifically indicated that Adenoviruses are aetiologically
associated with upper  respiratory tract infections.
James W.  Mosley21)

Mosley attempted to classify how often water is responsible for transmission
of viruses responsible for infectious disease.  He reported that infectious
  19)   T. G. Metcalf, "Biologic Parameters in Water Transmission of
       Viruses;" J. L. Melnick,  "Detection of Virus Spread by the Water
       Route, " Proceedings of the  13th Water Quality Conference,  Univer-
       sity of Illinois at Urbana-Champaign (1971).

  20)   S.  F. B. Poynter,  "The Problem of  Viruses in Water",  Water
       Treatment and Examination,  17:187 (1968).

  21)   Mosley,  James, Ibi d.
                                    •13

-------
hepatitis is the only disease caused by an agent having the characteristics
of a virus, for which evidence of waterborne transmission has been accept-
ed by all workers in the field.  Nonetheless,  he  considers relatively
neglected the entities known as viral gastroenteritis and diarrhea,  although
numerous waterborne epidemics are documented.

He points out that the limitations of using the term "viral" to apply to gas-
troenteritis or diarrhea are that it provides a wastebasket for those episodes
due to agents other  than viruses not yet identified  as pathogenic, as well as
for those instances  in which the search for known  pathogens failed because
of delay in the  investigation or technical inadequacies in the laboratory.

         "Despite these limitations, experimental studies have in-
         dicated  that forms of gastroenteritis and  diarrhea trans-
         missible from person-to-person are due  to agents with
         characteristics of viruses (Gordon et al,   1949; Jordan
         et al,   1953; Reimann, 1963).   It  seems  plausible, there-
         fore,  that under some circumstances such agents  could be
         waterborne.   In addition, it has been possible to associate
         epidemics  of gastroenteritis and diarrheal disease in new-
         borns with infection by enteroviruses (Eichenwald et al»
         1958; Lepine et al,   I960).  The extent to which enteroviruses
         are responsible for these  syndromes in older  children and
         adults is less clear.   Echovirus 11,  however, has been re-
         ported to have caused gastroenteritis in  adults working with
         this agent  in the laboratory (Cramblett et_al»    1962; Klein et
         al,  I960). No association of enteroviruses with waterborne
         episodes has been achieved, but it is uncertain whether any
         serious attempts have been made  to do so. "

Finally, Mosley  indicates that evidence for waterborne  transmission of
viral disease has been based primarily upon epidemics  in which cases
occurred within  a sufficiently short period of time to make person-to-
person transmission an unlikely explanation.

There is "feeling" at this time that the role of water transmission of viruses
is, therefore,  especially related to municipal systems using surface water.
                                    14

-------
O.  C.  Liu*

In a personal communication,  Dr. Liu addressed the question of dollar
costs associated with occurrence of viral illnesses. He  believes that from
available epidemiologic data, virus is perhaps the most important water
pollutant which could produce harmful effects in man.

Pointing out that water may be polluted by biologic,  chemical and radio-
active pollutants, Liu suggests that by eliminating these pollutants,
human health may be benefitted and these benefits may be converted into
dollar values.  Health benefits are enumerated as either dollars saved
for medical expenses as if the patient had not been sick and/or the
dollars which could be  earned in one's lifetime if the patient  dies.

Liu groups diseases produced by enteric viruses into two major
categories:

        a)  Acute clinical illness
        b)  Health effects from subclinical infections

The subclinical infections are of primary concern to us in this study.
Liu reports that subclinical infections of  enteric viruses in man, repre-
senting 100-1000 times the incidence of clinical infections, are far from
harmless.   Certain numbers of enteric viruses have been associated with
serious delayed  effects.

He states that the whole group of enteric  viruses may potentially be trans-
mitted by water through:  a) drinking and culinary use; b) recreation; c)
agriculture; and d) food.
     *  - O. C.  Liu,  M. D.  Personal Communication,  Office of
        Research and Monitoring, Environmental Protection
        Agency,  Narragansett, Rhode Island (1972).
                                     15

-------
With the present knowledge, therefore, he estimates that health benefits
could be of two extremes:

(a)  One extreme is represented by use of the published data oij waterborne
disease incidence as a basis for the health benefit estimate.  Practically,
these include only two diseases:  infectious hepatitis, and gastroenteritis or
diarrheal disease.  But he personally feels that the reported figures (Mosley,
Wiebel, et  al, etc.),  probably represent  only a fraction of those which
actually occurred.  "If we  estimate  health benefits according to the above,
the total value will be too low  to be realistic or true. "

(b)  The other extreme is to estimate the benefits by assuming that eradica-
tion of all enteric virus diseases incidence may be achieved if  the water
pollution from virus is under  control.  Liu feels there is circumstantial
evidence to indicate that the latter might  be the case.  "Intensive research,
however, is badly needed in this  area in order to definitely identify this
problem.  The long and insidious route of transmission by water is not at
all understood at present.  Water,  however, may serve as an essential
link in the chain of events in perpetuating these viruses.  Once this link
is broken,  the virus finally may stop to exist among human hosts thereby
eradicating these diseases. "

Liu gives his estimated dollar value which may be saved per annum,  if
indeed eradication of enteric virus diseases in this  country can be
accomplished by control of water pollution.  For acute gastroenteritis
and diarrheal diseases (2,000, 000 work days lost/year at $30/day) the
rough estimated figure is $60 million.

Health fa Mineral Characteristics of Water Supplies

Neri, Hewitt and Schreiber^Z)

In a recent review for the  Journal of Epidemiology,  Neri,  et al present
a synopsis  of findings on literature  investigating relationships  between
health and mineral characteristics of local water supplies. They
remark on  the general impression resulting from the literature reviewed,
(53 journal articles and papers) "that some 'water factor' does exist,  but
no agreement is in sight concerning the identity of this factor,  its likely
mode of action or even the pathological effects produced by it".
22) Luciano C. Neri, David Hewitt and George B. Schreiber, "Can
    Epidemiology Elucidate the Water Story?" Journal of Epidemiology
    99:2, February 1974, 75-88.

23) A comprehensive review of the literature is contained in Masironi, R. ,
    Miesch, A. T.,  Crawford, M. D., et al,  "Geochemical Environments,
    Trace Elements & Cardiovascular Diseases, " (forthcoming).  Another
    aspect of water mineralization damages can be found in Tihansky,
   ' Dennis,  "Economic Damages from Residential Use of Mineralized
    Water Supply", Water Resources Research, April 1974.
                                     16

-------
An overview of the findings from this literature review indicates that water
quality relates to cardiovascular disease, or some subdivision of it.  But as
Neri, et al point out, the support for this finding is not yet even very strong.
In fact,  "'it has been progressively weakened by the admission of bronchitis
'effect1,   '  by the suggestion that there is also an infant mortality 'effect1
and by-the estimate based on Canadian statistics that more than half the ex-
cess mortality in soft water areas  is certified to noncardiovascular cause
of death.26)


In their review, Neri, et al emphasize the  need for an increase in epidemi-
ological studies conducted in a systematic fashion, with due regard to the
need for specifying and testing explicit hypotheses.   They recommend studies
which utilize a method such as partial correlation analysis.  Their stress on
more rigorous analyses in the field of epidemiology is illustrative of a weak-
ness inherent in research - no matter what the discipline - which accents
descriptive and anecdotal criteria in the achievement of results.

Coliform and Pathogenic Organisms

T. Virarghavan2 "   indicates that in studies of water quality, "The coliform
organisms,  because of their large numbers in fecal matter,  offer a far more
satisfactory approach to the detection of fecal contamination than do the
pathogenic  organisms.  A search for pathogenic organisms  is very difficult
and time consuming and may prove fruitless when fecal contamination is
present but specific pathogens are absent".  He further points out that the
water utility profession relies on  coliform examination to determine whether
water is free from disease  producing microorganisms.
24)     Crawford, M. D. , Garner, M.  J. ,  Morris, J. N.  "Mortality &:
        Hardness of Local Water Supplies, " Lancet 1:827-831,  1968.

25)     Lowe, C. R., Roberts,  C. J. ,  Lloyd S.,  "Malformations of
        Central  Nervous System & Softness of Local Water Supplies, "
        British Medical Journal, 2:357-361, 1971; and Crawford, M.  D. ,
        Gardner, M. J.,  Sedgwick, P.  A. , "Infant Mortality and Hardness
        of Local Water Supplies".  Lancet, 1:988-992, 1972.

26)     Neri,  L. C. , Mandel, J. S. ,  Hewitt,  D.,  "Relation Between  Mortality
        & Water Hardness in Canada", Lancet 1:931-934, 1972.

27)     T.  Viraraghavan,  "Water  Quality & Health" Journal of the American
        Water Works Association (October 1973)  pp. 647^o50.
                                       17

-------
Virarghavan makes the statement that although pathogenic organisms may
survive water treatment that apparently removes all coliform bacteria,
the evidence available from outbreaks of waterborne disease  from treated
public water supplies shows that the coliform examination has been a satis-
factory measure of the microbial safety of water.

In this same regard, Krishnaswamir  states, "There is no evidence that
bacterial,  enteroviral and other microbial diseases are frequently trans-
mitted to man by water that meets the relatively stringent bacterial, dis-
infection, and chlorine residual standards  for drinking water".

Bathing and Recreational Water

It is of interest to  note that while uniformity exists for bacteriological
standards of drinking water in the United States, most of the  states have a
variety of bacteriological standards for bathing water.

The most pertinent study completed in the  bathing water quality and health
area was carried out by Stevenson prior to 1952.^°'  The goals of the r.esearch
were to determine what frequency of  swimming might be expected from the
population groups chosen; to determine what relative increase in illness in-
cidence might be expected  in the cleanest waters; and to determine what
differences in illness incidence might be expected from swimming in water
containing various degrees of bacterial pollution.

The studies were conducted over two months with families  living near   1)
a great lake; 2) an inland river and a fresh water recirculating pool, and
  3) a tidal water.   Stevenson's  overall findings were the following:

     a)  An appreciably higher overall illness incidence may be ex-
        pected in the swimming group over that in the non-swimming
        group.

     b)  When the total  illnesses among swimmers and non-swimmers
        were compared,  there appeared to be no significant correlation
        between the illness incidence and the quality of water in the
        areas studied.

     c)  Some 100 percent higher illness  in the group under 10 years of
        age was shown than for those over  10 years of age.

     d)  Among  swimmers, eye, ear, nose  and throat ailments represented
        more than 50 percent of the illnesses reported; gastrointestinal
        disturbances about 20 percent and  skin irritations the remainder.
28)  Krishnaswami,  p.  2262

29)  Stevenson, A. H. ,  "Studies of Bathing Water Quality & Health",
     American Journal of Public Health,  43:5, 1953, 529-538.
                                       18

-------
     e)  Specific correlation between illness incidence and bathing
        in waters  of a particular bacterial quality was observed in
        two instances.  Illness frequency was significantly higher
        among swimmers when the average total coliform density
        was up to 2700 per MPN rather than below 43 percent MPN.

Stevenson was particularly cautious about the latter finding as only three days
were on each side of the comparison; plus, thorough analysis of the effect
that weather might have had on the  result was not permitted.

McKee indicated findings which showed that sewage-polluted sea water
carried only negligible risk to health unless the water was "highly" polluted.

What remains unknown is the  generalizability of these findings  across time
and place.  Research in water recreation and health has received particu-
larly little attention, due to the difficulties inherent in carrying out a valid
study (i.e.,  data collection problems, costs involved in a study of statis-
tically significant proportions, difficulties with data multi-collinearity,
etc.).  Nonetheless,  the findings  described above, as well as other studies
•which reveal rising illness in the summer months, indicate the need for
program emphasis in this research area.

Summertime Gastrointestinal  Illness

Petersen and Hines carried out a study on the relationships between summertime
gastrointestinal illness and the sanitary quality of water supplies in six rocky
mountain communities. 31)  The research performed is notable  for its  employment
of rigorous statistical analyses.  Additionally, Petersen and Hines identify their
analytical methods and data sources.

Their  query examines existence of  a correlation between unreported gastro-
intestinal illness and the populations using the water supplies.  Approximately
100 families in six Rocky Mountain communities were surveyed.  No schools
were in session during the  period.

Variables included: family experience with gastrointestinal illness during June,
July and August of 1957.  (Gastrointestinal illness was  defined as diarrhea with
or without nausea and vomiting, persisting for at least one day); and bacterio-
logical water analysis for each town over the  past 3 years.

Illness experienced was obtained  by means of survey on a house-to-house basis
by 2 investigators, using a standardized form, directed to one  responsible
household person, usually the mother.  Data were recorded on McBee cards
specifically designed for gastrointestinal illness. Table compilations  were
made up of the observations.
30) McKee,  J.  C. and Wolf, H. W. "Water Quality Criteria",  California
    State Water Quality Board Publication No. 3-A,  1963.

31) "The Relation of Summertime Gastrointestinal Illness to the Sanitary
    Quality of the Water Supplies in Six Rocky Mountain Communities",
    American Journal of Hygiene,  Vol. 71:314-3ZO,  November 1959.
                                      19

-------
Bacteriological water analysis was taken from the office records of the State
Health Department.   Numerical values were assigned based on a demon-
strated consistency of quality over a three year period.

Data manipulations were conducted by means of simple chi square tests of
epidemiological factors  associated with the illnesses showing significant
differences between the  group of communities with water of low sanitary
quality and those in the group having high sanitary quality.

Their findings  concluded that no patterns could be found from analysis of
chronological arrangement of cases of illness in each community by date
of onset, which could be interpreted as being the result  of a common source,
single exposure type  of infection.  Only 20 of the 206 cases showed a history
of eating away from home during the week preceding illness.   No correlation
was found between incidence of illness and particular brands of milk.

No correlation was found to environmental sanitation deficiencies, because,
of 570 premises surveyed, all had running water inside  the house, and only
three had sewage  disposal of a type other than an indoor, water-flush system.

Communities with water supplies of low  sanitary quality were  found to have
an overall attack rate of 13. 9 percent, while communities with water of high
sanitary quality had an overall attack rate of 8. 8 percent.  This correlation
between water quality and attack rate was found consistently when computed
separately for each community.

The attack rate for males was higher than the rate for females in both groups
of communities with  no significant difference.  A concentration of illness
among the young appeared to  be greater  in communities with water supplies
of high  sanitary quality.  Attack rates computed by family size indicated no
consistent gradation. The percentage of cases occurring in July and August
indicated a definite increase as the summer progressed, for both groups of
c ommuniti e s.

Interestingly,  an overall attack rate among people who had lived in the
communities with low sanitary quality water supplies for less  than two years,
•was twice that of the people who had lived  in the communities more than two
years.

Benefits from  Reduction of Waterborne Disease Outbreaks

Jack Lackner did  the only cost-benefit study we found at all attributable to
•water quality effects  on some low  level illness.   '   He  queried the extent
to which benefits from anticipated reduction in waterborne  disease outbreaks
and health damage can be  predicted!  His point of reference •was the Safe
Drinking Water Act of 1973.
32)  "Safe Drinking Water Act of 1973:  Estimated Benefits and Costs:,
     EPA Office of Planning and Evaluation.
                                     20

-------
 He examined disease data in the United States,  1961-1970; flourida-
 tion practices in the United States (1967 capital investment data con-
 sidered in relation to 160 million people served); and, future water
 supply system upgrading (costs for 1600 supplies).

 Data and sources were:  1)  Disease incidence from Morbidity and
 Mortality, annual supplement,  Summary 1970, USDHEW, Center for
 Disease Control;  2)  Bacteriological data,  effectiveness of flourida-
 tion, and lead excess data from the "Community Water Supply Study ";
 3)  Capital investment data  in public water  supply facilities from 1967
 Department of Commerce data and;  4)  Department of Commerce and
 the American Water Works  Association estimates on future costs of
 upgrading water supply systems to comply  to 1962 PHS Drinking Water
 Standards.

 Data manipulations included the following:  1)  Judgmental estimates
 were made by Public Health Scientists that 1/3 or approximately
 245, 000 of reported disease cases  are in fact waterborne. An assump-
 tion that the recording of cases of waterborne'disease is about 10 per-
 cent of actual number of cases of waterborne disease, increased the
 number to 2,450, 000 in the 10 year period.  2) Estimation of in-
 come at $25 per day and hospitalization at $200 per day.  3)  Assump-
 tion of 1,000,000  cases of drinking water gastroenteritis per year at
 comparable costs of $50 per case.   4)  Correction  of poor flouridation
. using "Community Water Supply Study" indicator of 38 percent of the
 86, 136,000  people receiving ineffective levels of flouridation as it
 pertains to dental disease.  5) Capital investment data in public water
 supply facilities considered in relation to the 160 million people served
 at $312 per  capita.  6)  Assumption of three chlorinators needed per
 system for $24, 000  systems at a cost of $3, 000 each. 7) Community
 water supply figures indicating that 1,000 supplies  need defluoridation
 equipment,  and 600  supplies need correction of failure to meet  constituent
 standard of lead.

 Lackner's findings included a total for estimated annual benefits of
 $810, 000, 000.  This figure was derived from the $306 million attribut-
 able  to reductions in communicable diseases; $50 million attributable
 to reductions in the  incidence of gastroenteritis; $454 million attribut-
 able  to reductions in dental decay.

 DOC and AWWA data estimated that $6 billion should be spent at a rate
 of $500 million per year over a 15 year period for improvement of water
 supply facilities.  The yearly cost of program to correct inadequate
 design, construction, operation and maintenance of water supply facili-
 ties, was  estimated at $2. 5 billion.  The direct Federal cost of imple-
 menting legislation on water quality was estimated  at $27 million in
 FY '74, increasing to $63 million in FY '78; estimated cost to states
 was $50 million each year;  estimated costs to communities was $165
 million annually.  The additional future costs required to upgrade all
 water supply systems was estimated at $6  billion, to be spent at the rate
 of $500 million per year.

-------
                             Summary
Overall, the Petersen and Hines morbidity study revealed a signifi-
cant relationship between the sanitary quality of a community's water
supply and the incidence of gastrointestinal illness in that community.
Communities served by contaminated water suffered an attack rate
of 14 percent during the summer months as opposed to the 9 percent
attack rate for people  using water supplies showing no contamination.

The latter finding corresponds to the Weibel,  et al, peaking of water-
borne disease outbreaks due to private supplies,  in the summer
months.

Lackner's analysis resulted from studies showing that 30 million
people, served by 8,000  community water supply systems, were re-
ceiving inadequate flouridation in their water for maximum benefit
to their dental health.   Furthermore, 64. 4 million people,  served by
approximately  16, 000  systems, were receiving water containing
dentally insignificant flouride ion levels.  His cost benefit study gives
rough estimates of monetary and health savings derivative from effec-
tive flouridation levels, which are calculated to be in the millions.  No
other researcher to our knowledge has published findings comparably
achieved.

Of the literature reviewed relevant to water quality and low level ill-
ness, the above two studies were the most analytically rigorous, and
empirically definitive.

More precise analyses have been undertaken in the area of air pollu-
tion and health, several of which are  quantifiably verifiable.  Partic-
ularly notable is work done by Ridker 33) on morbidity, mortality
and respiratory disease, and the subsequent studies by Lave and Seskin
34) which expanded the diseases covered by Ridker.  Ridker estimates
that 18 to 20 percent of the approximately $2 billion in national health
costs results from air pollution.  Lave  and Seskin, in a conclusion
which they consider to be conservative, estimate that air pollution
damage amounts to 4. 5 percent of all the economic damage associated
with morbidity and mortality.

Similar findings are being reported which specifically indicate relation-
ships between air pollution and serious  illness.  For example,  a recent
report by Henderson,  Menck and Casagrande of the University of
Southern California  indicates that residents of South Central Los Angeles,
who have  for years lived near oil refineries and chemical plants, are
more likely to  die of lung cancer than people who reside elsewhere.
33)  Ridker, Ronald G., Economic Costs of Air Pollution.
     New York, Frederick A. Praeger,  1967.
34)  Lave,  L. B. and E. P. Seskin.  "Air Pollution & Human
     Health", Science 169(3947):  723-733, August 21, 1970.
                                 22

-------
In a study which specifically addressed air pollution vis-a-vis low
level illness,  residence data again proved of consequence:  Chapman,
et al, queried to what extent air pollution exposures related to the
prevalence of chronic respiratory disease.  35) Using data from
New York, findings suggested that moving from a polluted area to
a clean one can promote substantial improvement  in symptoms.

Other evidence from the same study suggested that sulphur oxide
pollution exerts an effect on chronic respiratory disease which may
rival the effect of cigarette smoking,  particularly in areas where
high exposures to sulphur oxide are coupled with suspended
particulates.

A final air pollution/low level  illness study by Thompson,  et al,  in-
vestigated the effects of air pollution on the common cold. 3°)Using
multiple regression it was found that statistical analyses  differed
markedly by season of the year and by whether incidence  or common
cold was being examined.  Meteorologic variables used appeared to
be more related to the common cold rates than the pollutant variables,
but examination of the data failed to  "explain" the  reasons underlying
this finding.

Overview of Findings
While highly definitive  findings examining water pollution relationships to
low level illness are still in the future, of the studies conducted to date, the
evidence points to probable significant relationships.  An overview of find-
ings from the literature reviewed can be summarized as  follows:

•   The role of water as a causal factor in occurrence of low level
    illness has not received serious consideration by scientists.
    This results  partially from the fact that in both water and air
    pollution studies, the role of environmental pollutants  in
    mortality and chronic disease is difficult to quantify because
    so many other determinants of death and disease cannot be
    adequately measured.  A  simple relationship of causal specificity
    does not always exist.

    Studies of waterborne illness covering the years  1920-1936, and
     1946-1960 have classified over 80 percent of  the cases as gas-
    troenteritis or  diarrheal illness.  While not specific diseases
    because no etiologic agent can be identified, these categories
    of illness account for the  vast majority of American illness due
    to waterborne sources.
35) Chapman, Robert S. et al, "Chronic Respiratory Disease Symptom
    Frequency in Late Adolescence, In Parents of School Children and
    in Elderly Persons",  Presented at the American Medical Associa-
    tion's Air Pollution Medical Research Conference, Chicago,  111. ,
    October 1972.
36) Thompson,  et al, "Health and the Urban Environment: Air Pollu-
    tion,  Weather and the Common Cold", Journal of Public  Health,
    Vol.  60,  No.  4(1970), 731-739.
                                 23

-------
Additional studies are needed in the area of viral detection, enumera-
tion, transmission,  and elimination because their basic properties
are poorly understood.


There is sufficient evidence that virus can be transmitted by water
and result in human infection.  However,  intensified research has
yet to prove conclusively whether bacterial, enteroviral and other
microbial diseases are frequently transmitted to man by water
that meets the bacterial, disinfection and chlorine residual stand-
ards for drinking water.

Past evidence for waterborne transmission of viral  disease has
been based primarily upon epidemics in which cases occurred
within a sufficiently short period of time to make person-to-
person transmission an unlikely explanation.

There is evidence that viruses may produce widely variable
incubation periods and manifestations in different individuals.

Some sources indicate that the role of water transmission of viruses
is especially related to municipal systems using surface  water.

Although present water purification technology can handle all
significant bacterial problems,  in practice, available technology
is not always  applied.  Many water systems are overage, sub-
standard and unable to meet peak demands.

Some morbidity studies  reveal a significant relationship between the
sanitary quality  of a community's water supply and  the incidence
of gastrointestinal illness.

Research is needed in the area of bathing and recreational water
to study indications that swimmers suffer from eye, ear, nose,
throat ailments, and gastrointestinal disturbances more  often
than non-swimmers.

Two studies have shown a peaking of waterborne illness in the
summer months, particularly as this season progresses.

Residence data in recent studies indicate that longevity in a
polluted area contributes to symptoms of serious  diseases.
However,  an overall illness attack rate among people who have
lived in communities for shorter periods of time can be observed
as being higher than for people who have lived in communities for
longer  periods (for  example, more than two years).

One estimate states that $60 million can be saved per annum through
eradication of gastroenteritis and diarrheal disease from water
pollution.
                                 24

-------
                           SECTION  V
                        RESEARCH DESIGN

The Problem

      The driving question of this research project developed from the CWSS
study of drinking water around the nation.  If, indeed, drinking water is not
uniformly good and water treatment standards are not consistently followed,
does this have any health implications with economic effects?

      There is an image that our drinking water is pure and that there are
no serious health problems resulting from it.   It is now apparent that the
first half of this image is shattered.  Yet there are large numbe rs of
epidemologists who would still argue that there are no  serious health effects.
They will point out that there are very few known outbreaks  of epidemics
which can, with certainty, be traced to drinking water.  They will argue
that we produce anti-bodies to protect  us from most bacteria so  that even
if our drinking -water is poor, we will develop immunity to most diseases
it could produce. Mexico is indicated as a prime example of a country with
contaminated drinking water which presumably does not affect the health
of the residents.

      To a very large extent, these critics are correct. Indeed there have
been  isolated known water-related epidemics.  We do develop
effective anti-bodies and the water probably could get considerably worse
before epidemics of major serious diseases occur.

      Despite this,  the critics may be missing a very important point.
People do not have to contract a major disease to get sick.  Unreported
sub-clinical illness may not be fatal, but it is unpleasant and causes lost
work and occasional medical costs.  And secondly, the absence of epidemics
does not rule out the possibility that the drinking water may carry micro-
organisms which do, in fact, result in  illness at a sub-epidemic  level.

      As we have said  above, the over-whelming majority of health related
absenteeism comes not from the  "serious" diseases  such as cancer,  heart
attacks, etc. but from sub-clinical illnesses such as flu, upset stomachs,
and colds.

      We  believe that if our drinking -water has any appreciable economic
effect on the community, it will be through low level illness. Variations
from community to community  in this type of illness  could easily be
significant yet go unnoticed.
                                    25

-------
Data

      Given the above general hypotheses,  it is necessary for us to
operationalize  the concepts and to  postulate  a set of more specific data-
based hypotheses.  In doing this it is useful to describe the data which we
would ideally want in order to best test our general  hypotheses  (not that
we could get it, because we always have to make some concessions to the
reality of availability.) Knowing what we would want, however,  enables  us
to better understand the characteristics of the results which we obtain
with real  data.  Ideally, to perform a study with these goals, we would
like  a good measure of adult illness which would reasonably be  related to
drinking water.  Ideally again,  this would include  only those sub-clinical
illnesses  whose virus or bacteria can be transmitted by  drinking water.
We would want a  good measure  of water quality including average coliform
counts for a year, and the maximum coliform count for a period perhaps
as long as a month.

      Additionally, we would want measures  of community socio-economic
characteristics such as average income, population density, racial
distribution, and housing characteristics.  Finally, we would want  to hi ve
average and extreme values for particulate counts and sulfur oxides.

Absenteeism Data

      Obtaining direct measures of adult sub-clinical illness is virtually
impossible  without extensive surveying.  The problem is that there is no
clear measure of low level illness by community for those communities
which we  also  have water  quality data. The National Health Survey is an
excellent measure of illness  of all types, but it is a nationwide  sample
which cannot reasonably be broken down to the small communities in which
we are  interested.

      People generally do not report low-level illnesses unless they are
particularly severe or prolonged.  Thus, doctor's records, hospital
records and public health  records are  quite  inadequate for our purposes.

      Since we are primarily concerned with finding the economic costs
associated  with inadequate drinking water, it is reasonable  to consider
only those incidents  of illness which resulted in absenteeism.
                                 26

-------
      Although this simplified the task of measuring community health
to some extent, it clearly was not without problems.  Direct measures
of adult absenteeism are difficult to gather and are of questionable
value even where available. As  we shall demonstrate, indirect measures
appear to be superior.  First, however, let us deal with the problem of
direct measures.

      Any community of 5, 000 or more is large enough to have a reason-
ably diverse set of business enterprises.  There may be, for example, one
or two medium-sized factories,  a few small manufacturing  concerns, an
array of retail establishments,  a small number of construction firms,
some government officials, and perhaps some mining or agriculture-related
businesses.

      Records of absenteeism are not kept uniformly. Some companies
have detailed records,  other are very incomplete.  The variation occurs
along at least two dimensions:   size of business and style of management.
Normally large concerns keep better  records than small businesses as a
matter of necessity.  Furthermore, just as a matter of personal style,
some managers simply keep better books  than do others.   This variation
is unknown,  immeasurable, and  not necessarily random.  Thus, even
given some acceptable  sampling of employees in a community, it is
questionable whether the measure of absenteeism would even reflect the
number of days for which work  was missed.  Even if the actual number of
days missed were accurately estimated,  there are a number of intervening
variables which would make the  interpretation of these data of questionable
value.  Occupation-related illness, variations in paid sick leave,  insurance
policies,  commuters who do not live in the cities where they work,  etc. , all
pose problems which would be extremely  cumbersome  to overcome.

      Given these problems, we chose to  select a different measure of
absenteeism which,  although not perfect,  has fewer difficulties:^lernentary
school absenteeism.  This  measure has a  number of characteristics which
make it a reasonably good measure of water-related absenteeism.

      It is such a large proportion of the poulation of elementary  school age
children, that the measure can  be considered to be the population reflecting
the socio-cultural and economic variations of the communities within
which they reside.
                                   27

-------
      Unfortunately, the purity of population is often somewhat diminished
by the presence of students bussed in from smaller  rural communities.
Obviously the water which these "outsiders" drink is different from that of
the local children. Although complete compensation for the problem cannot
be achieved given the resources available to us,  there are three factors
which tend to relieve the problem.  First,  the selection of elementary
schools (rather than middle  or high schools) reduces the problem con-
siderably.  School centralization is far less  frequent for elementary schools
than for higher grade levels. During  the course of this contract we were
pleased (for scientific reasons) to find that large numbers of small
communities will have their own very small elementary schools rather than
selecting to send their children to larger  cities a few miles away. Secondly,
although when they are at home, children from outside a school district are
exposed to drinking water  of possibly different quality than was measured
by the CWSS study, when they are at  school, they will drink the same water
as other students.

      Finally, in one state (Vermont), the records of children who are bussed
in from outside the school district are kept separate from those within the
school district. Thus, in this state,  the records available to us reflected
primarily only those children who actually lived in the  school district.
Since the  CWSS study recorded all public water systems in each town visited,
this means  that for Vermont, we have a high confidence that the student
population is nearly entirely drawn from within the  water system  boundaries.-  ''

      A second advantage  of elementary school  data over its adult counterpart
is that occupational-related  absentees simply are not there. While variations
in school health programs may produce some variations in absenteeism, the
problem cannot compare with the variations that surround different occupa-
tions.

      A third advantage is that there  is no equivalent to paid  sick-leave
days. The adage that  "sick days taken expands to meet sick days allotted"
is not applicable.

      A fourth advantage of  elementary school data  (over higher grades)
is that unexcused absence is not as prevalent.  Certainly it  exists, but
the overwhelming number of educators consulted expressed the opinion
that xinexcused absenteeism is more  severe  in the high schools than the
elementary schools.
 37)  Since we were able to analyze data at the state level,  it will be
     seen later on in this report that we used Vermont as a check for
     those areas where the data were aggregated only by school district.
                                   .28

-------
       Finally, although there are a number of socio-economic factors
 underlying much of the absenteeism including both excused and unexcused
 absences these are systematic and analyzable.  The economic state of the
 community, housing characteristics,  etc. , can all be engaged to account
 for some of the  variation unrelated to water.

       Probably the most widely raised question about the use of school
 absenteeism data has been the fact that we do not know what illnesses
 caused children to miss school. The child with the broken arm is recorded
 exactly the same as is the child with gastroenteritis.   The former obviously
 is not water related - the latter may well be.  How,  our critics will argue,
 can we possibly make  statements about the health effects of poor drinking
 water when we have no idea what illnesses the children had?  The problem
 is resolvable given one assumption:  that  controlling for all other systematic
 influences on health,  non-water related illnesses occur  randomly. That is, once
 we account for all systematic effects from economic well being,  air pollution,
 urbanization,  etc.8  the remaining illness is distributed randomly except
 for water-related illness. If the assumption is true, we have a classic
 problem of data with random noise.  Its effect (Rummel 1965) will serve
 only to reduce the "real" correlation.  In summary to our critics on this
 point, we can say that if they accept the assumption of randomness, then
 results cannot possibly be inflated and probably are reduced from what we
 would expect if we were able to control for that randomness. *°'

 Data Collection Procedures

       Elementary school attendance  data was collected for  school year
 1969-70  for all  communities  having at least one elementary school and
 having been studied in the 1969 CWSS study.

       Since both the 1968-69 and 1969-70 school year's overlap the calendar
 year 1969, we had the option of choosing either for our  data base.  Although
 the 1968-69 school year covers more months of 1969 than does the  1969-70
 school year, we chose the latter because  of the wide-spread Hong Kong flu
 epidemic from November 1968 through March of 1969.   We believe that
 this could have  introduced additional uncontrolled variance. At its peak,
 the epidemic was  afflicting 10 million people per week.  ''

       In the collection .of our data, we found that central records are
 normally available in the state capitols or county seats.  We were courteously
38) Although the correlations will be low,  the regression coefficients will
    still be unbiased.
39) Wilder, Charles S. , "Acute Conditions Incidence and Associated
    Disability. "  U. S. Department of Health, Education and Welfare,  Public
    Health Service, 1972.
                                    29

-------
  given access to these records by state and county officials. In nearly
  every state at least a  small number of individual schools had to  be
  contacted because the state did not have their communities recorded
  separately. Where central records were available only the smallest
  of the communities had to be  contacted separately.  In five states
  (Kansas, Missouri, Louisiana and California) attendance records were
  not  kept in a central location  and all schools had to be contacted
  individually.

  Water Quality  Data

        We used two direct measures of drinking water
  quality and 18 indirect measures covering various aspects of water
  treatment facilities and their direct operators.  These data, shown in
  Table 3  below were  taken from the Community Water Supply  Study
  (CWSS) performed in 1969 by the Public Health Service (PHS). 40)

        Unfortunately,  the  CWSS did not take adequate precautions to
  insure against the possibility of stochastic variation in drinking
  water quality.  The major portion of the data, in fact, was collected
  at one point in time.  This would be adequate if drinking -water quality
  was  relatively stable across time.  However, it is a widely held
  belief that for many reasons,  the quality  of U.S. drinking water  can
  vary considerably across time. Since we have only a one time
  measure of water quality for any particular tap,  it is  necessary to
  assume that water quality is  stable across time  if the measure is
  to be acceptable as a  linear function of an average acres s an
  entire year.   In most instances, this is a dangerous assumption.
  Therefore, the results which will be discussed in this report must
  be evaluated with cognizance for the fact  that the water quality data
  were not collected in  a rigorous time series.
40) McCabe, op. cit.
                                   30

-------
                              Table 3   '

Total Coliform
Total Plate Count
Range of least and most monthly samples
Is the lab certified ?
Frequency of sampling of finished water
Common walls between finished and lesser quality water?
Are there inter-connections to other systems, of known acceptable quality?
Of unknown quality without protection?
Detectable chlorine residual in distinct parts of the distribution system?
Chlorination process interrupted XX times in last 12 months
Highest level of formal education
Length of time on this  job
Total years water purification experience
Is operator full time employee?
Salary range
Level of training in water treatment
Level of study in microbiology
Level of study in water chemistry
  41)  Heavy metals and toxics, etc. were not included in the study.
      The reasons are that 1) the presence  of heavy metals  is so in-
      frequent as to make statistical analysis unreliable; and 2) the
      effects of heavy metals are not likely to show up in school
      attendance data.
                                    31

-------
       Fortunately,  the CWSS did provide us with a number of potential
 surrogate measures which make intuitive sense, and which we also found
 to   bear some interesting relationships to school absenteeism.   These
 measures are the indicators of plant facilities,  procedures and of the
 operators' capabilities.   Logically we expected that poor quality  operators
 and/or facilities would tend to produce poor quality water. Although we
 have no data which specifically enable us to compare the surrogate with
 the actual variable, ahe linkage seems to'be  a  reasonable one. Additionally,
 the surrogates (salary of operator,education of  operator) are stable across
 time and have some significant theoretical linkages to drinking water  quality
 which will be  discussed  below.

 Air Quality Data

       The air quality data was obtained from EPA's National Aerometric
 Data Bank: Yearly  Frequency Distribution. Although SO_ and particulates
 are both important measures,  only the latter was  sufficiently available to
 permit its usage.

       Unfortunately, air quality monitoring stations are not necessarily
 located in the same communities which were  studied in the CWSS study.
 We, therefore,  had to attempt to estimate the quality of the air in the
 CWSS communitiesi by using data for the quality of air in nearby communities,

       The estimations were made by geographically locating the  set of CWSS
 communities being examined in this stxidy and the most proximate air
 monitoring  stations.  Our first rule was that we assume  that the  air quality
 is equal to the air quality of the closest monitoring station within 15 miles.
 If within 15 miles,  two or more stations were approximately equidistant from
 the CWSS community, the one  which came  closest  to being upwind, given
 the prevailing winds, was selected.

42) We cannot compare water quality with the  surrogates  because we do not
    trust the direct  measure  of water quality.  Clearly, if we had direct
    measures of water quality which we believed, the surrogates  would  not
    be necessary.
                                     .32

-------
      In many instances no air quality stations existed within 15 miles.
Indeed,  in some areas no measures were to be found within a 50 mile
radius.  This posed a dilemma because,on one hand, we wanted to retain
the option of using all the data -even if questionable - but we also  wanted
to have the option of rejecting those air quality data associated with stations
more than fifteen miles from the city in question.  Therefore, we  created
a dummy variable whose value is  1. 0 for cities with air data gathered
within 15 miles and  -0. 0 for those whose data were gathered at a  distance
greater than 15 miles.  This  gave  us the option of treating the air  quality
data as "missing" if it -were  gathered at a distance of over 15 miles.
Socio-Economic Data
      The socio-economic characteristics of the communities were obtained
from the 1970 census. We used the tapes containing the first count data.
From the list of variables available on the first count, we selected the
following subset:

            1  Total population
            2  Percent rural
            3  Percent SMSA
            4  Percent urban portion of SMSA
            5  Percent rural less than 2500
            6  Percent in urbanized areas
            7  Percent less than 18 years old
            8  Percent 18 to 62 years old
            9  Percent 62 and older
           10  Percent white
           11  Percent black
           12  Percent of families with a husband  and wife
           13  Percent other male head of household
           14  Percent other female head of household
           15  Average home value
           16  Average rent
           11  Percent units less than  1.0 persons/room
           18  Percent units 1.01 - 1. 50 persons/room
           19  Percent units greater than 1. 50 persons/room
           20  Percent units with complete kitchen
           21  Percent units with complete plumbing
           22  Percent without complete plumbing
           23  Percent units with shared toilet
           24 Percent persons with complete plumbing
           25  Percent persons without complete plximbing
           26  Percent families with complete plumbing
                                    33

-------
      We believe  that this subset of community characteristics are the
most likely to have some influence on health.  The 26 variables are not
all mutually independent. In fact, they fall into six identifiable categories
as follows:

           Variables                        Category

           1-6                               Urban/Rural
           7-11                              Age/Ethnic
           12-14                             Family Characteristics
           15-16                             Wealth
           17-19                             Density (persons/room)
           20-26                             Home Plumbing

      Of these six categories, we  expected only the last five to enter
directly into  regressions. Given the results of the CWSS study, we
expected that the urban/rural distinction would present particular problems
which would require a dichotomization of the sample rather than simply
an additional term in the regression equations.  The necessity to dichoto-
mize the sample will be discussed with the other hypotheses in the following
section.

      It should be noted that for each of the remaining five sets, it is
only reasonable to use one of the possible variables at a time  because they
are either  statistically or sometimes linearly related to each  other. For
example,  since variables 12 to 14 must eqxial 100%,  any two of necessity
define the third.  Furthermore, the correlations between any two are
expected to be so high as to prohibit the use of more than one  because of
the problem of multi-collinearity. We chose to include each of the variables
in the data set for pragmatic purposes. There is no research  or theory which
would give us good reason to choose one version of the same variable  over
another. Rather  than arbitrarily exclude the data,  we chose to include each
version in our initial analysis.   We employed a modified stepwise regression
by limiting our final variable set to one variable per category.  We felt that
this relatively mild limitation (one variable per category) made the  inter-
pretation more manageable as well as alleviating  somewhat a  troublesome
degree of freedom problem.

Climate

      We included the climatic measures of average hours of  sunshine, the
ratio of temperature/humidity, and inches of precipitation to attempt to
estimate the  effect of climate on health. These climatic variables were
                                   34

-------
included because it is widely recognized that weather does influence
health.  The fact that sub-clinical illness is the greatest during winter
months is virtually unquestioned.  Additionally, it is normally believed
that the effect is most severe in the colder and more humid areas.  It
should be recognized, however,  that in this study we had no time series
absentee data to pick up seasonal illness variations and we had only nine
geographical regions thus giving us only nine data points for our climate
variables.   Because of this, our expectations were not particularly  high that
we would find that sunshine or the temperature/humidity index would
produce interesting results.

Hypotheses

      If health is to be statistically linked to water quality in a natural
experimental setting,  it is necessary to us to recognize that,  in un-
controlled situations,  people become  ill for a wide variety  of reasons.
Some of these may have to do with specific illness-producing substances
such as air pollution.   Others may have to do with inadequate  diets or
poor health care.  Still others may result from living in overcrowded
conditions or sub-standard housing.  To the extent that these factors
systematically influence health,  we attempted to account for them in this
study.  At the most general level, then, we are hypothesizing that,
accounting for all other macro-systemic influences on health, the
occurrence of sub-clinical illnesses will be the highest in those communities
with the worst water  quality. More specifically,  we are arguing that these
factors account for absenteeism roughly according to   Figure 1 (next page).

      We are arguing  that the elements in the boxes act directly on
elementary school absenteeism. The  elements in the diamonds act on
absenteeism through the elements in the boxes.  All of these are encompassed
in the broader circle which includes  those factors which have  complex
and wide ranging effects on all of the  factors  and/or their relationships
with absenteeism.  Because these factors act in concert,  we cannot simply
pull out water quality and analyze  it by itself. Although the other variables
a-re "just" control variables, they are vital to the successful analysis of
the absentee data.  Let us consider the control variables  first.

      Health care is one  of the most obvious links to absenteeism.  Clearly we
would  expect that improved nutrition,  housing, medical  attention,  would
directly influence health.  Assuming that health is linked to absenteeism,
we  can presume that there is a close  linkage  between  health care and
absenteeism.

      Similarly, cultural factors  can be argued to  link  directly Ic  absenteeism.
The ethics of social or cultural groups in all probability influence the
motivation to attend school. This may, on one hand, result in simple
unexcused absence.   On the other hand,  some cultural/economic groups

                                   35

-------
          Figure 1
                    Socio-Economic
                        Status
                                           Social
                                           Groups
                                            Cultural
                                            Norms
Elementary
School
Absenteeism
Air
Pollution
                       Drinking
                       Water
                       Quality
     Regional Differences and
     Urban/Rural Differences
              36

-------
may provide students with encouragement to attend school  even when
they are ill.

      Both health care and cultural factors, we argue,  are related to
socio-economic status.  Normally we expect most of the factors associated
with health care to be positively related to socio-economic status (SES).
For example, it is common knowledge that the children of low income
parents suffer greater physical and mental problems from poor nutrition
than do children of more affluent parents. We might also expect that
there is some SES  level above which there is no general improvement in
health care.  We also expect that the protestant ethic of the upper and
middle classes would have an effect on those cultural norms partially
influencing school attendance.

      In addition to the  economic determinants of  cultural norms, it is
possible that other cultural descriptors are salient. For example, many
people argue that those communities with high percentages of blacks
have different behaviors toward education above and beyond the fact that
they are normally in lower economic groups.  As a point of interest,  it
should be noted that there  was no empirical  confirmation  of this position.

      Air pollution and climate are two of the other factors presumed  to
directly cause  health problems and, therefore, school  absenteeism.  The
exact details of the relationship between air  pollution and health are
somewhat inconsistent across  the numerous  studies.
Nonetheless, there seems little question that air pollution is related to
health in some manner.  The relationship between climate and absenteeism
is two-fold.  On the one hand,  as we discussed above,  climatic variation
influences health.  Additionally, adverse weather can close schools,
temporarily, obviously increasing absenteeism.

      All of these relationships  mentioned above  are themselves
partially and  complexly influenced by regional variations and by urban/
rural differences.  As a simple example, adverse weather often has a
more  serious effect on the closing of rural  schools than on urban schools
because of the relatively large miles  of highway per person requiring plowing.

      Popular wisdom argues  that the protestant ethic  in rural America
when coupled with  the allegedly healthier life style should reduce
elementary school absenteeism.  On the other hand,  the frequently lower
incomes of rural communities,  the more limited  access to medical
facilities and other factors conceivably could counteract this effect. All
                                    37

-------
of this is further complicated by the fact that we know for a fact that
water  treatment facilities in rural areas are inferior  to those of
urban facilities.  '

      All of these factors basically lead to the fact that these broad
complex factors (urban/rural and regional differences) must be
handled carefully in the analysis  of the data.

      Having discussed the control variables, let us  now move to the
independent variable  of  interest - drinking water quality.  We  discussed
in the literature review the arguments and analysis supporting the
hypothesis that poor quality drinking water can cause illness.  That this
should logically be the case is not the least bit surprising. What is
somewhat surprising is the fact that the water provided through public
systems may on a widespread basis,  be of low enough quality  to produce
illness . Given the results of the CWSS  and other studies, we have good
reason to believe that people do in fact become ill from low quality
drinking water. As we see in figure 1«  variations in  drinking water
quality are  hypothesized  to be influenced by variations  in either the quality
of the operators or the quality of the facilities or both.

                          44)
       It is widely accepted  'that the reductions in quality  resulting from
human error tend to be sporadic  rather than steady state.  Workers ranging
from assembly line operators to  air traffic controllers tend to err by
making occasional mistakes.  They tend not  to perform in a constant
sub-standard manner. The poorer the worker the more frequent and more
severe are the mistakes. Thus,  if we were to directly measure the quality
of the water produced by poor operators, we would have to have  a time
series recording the frequency and magnitude of the  deviations from the
average.  Because we do not have data of that quality, we must use measures
of operator qualifications as surrogates for water quality.

       The CWSS reports that water quality varies regionally and by level of
43) McCabe, Op. cit.


44) See for example: 1) Niece, E. H. ,  "Statistical Quality Control and
    Value Analysis, " in Carl Heyel (cd. ) The Foreman's Handbook,
    McGraw Hill,  New York,  19&7; 2) Levinson, Harry, Executive
    Stress,  Harper and Row, 1970; and 3) Barnes, Ralph, Motion and
    Time Study: Design and Measurement of Work,  Wiley,  New York,
    1964.
                                   38

-------
of urbanization. Urban regions tend to have both higher levels of water
quality, better trained operators and more sophisticated facilities. Since
we suspect that school absenteeism, water quality and the  control variables
all are influenced by the urban/rural and regional variation, it is apparent
from the outset that a generalized model applying to all cases may encounter
difficulties.  Because of this, we have  to recognize  the possibility that while
it may be possible to develop a general model of the effect of poor quality
drinking water, it is likely that we will also  need alternative models  to deal
•with the problems of multicollinearity  and systematic bias in both the
dependent and independent variables.

      In addition to multiple models, we will also use dummy variables
which themselves include some of the  correlated interactions we are trying
to simplify.  In this  study,  we added dummy variables representing each
of the SMSA's being studied. For  example, we had one variable each for
Vermont, New York,  Cincinnati, etc.   If a community were in the region,  it
received  a 1.0, otherwise  it was  scored a 0.0.  This approach enables us
to try to include unaccounted systematic regional differences in elementary
school attendance. It is important to note that this method is only marginally
effective in controlling for  any regional bias in  the independent variable set.

      The primary hypothesis is  that one or  more versions of the following
equation should predict absenteeism.

A..= C.-M    W.+ 4. ,.  E.+ A.- F.-f 4  I. + j    D.+ ((,. S.+ / _. P.-f j^.CL +e.
  ij   J   lj   i   2j   i ^3j  i   Mj  i   \5j  i  %j  i A 7j  i  ^8j   r   i

where
      Subscript indicates a water  supply within  a subset area j

      A = elementary school absenteeism
      C = constant
      W = water quality measured either by  plate count of  coliforms
          or by quality of facilities
      E =; one variable from the age/ethnic category
      F = one variable from the family characteristics category
      I = one variable from the wealth category
      D = one variable from the density category
      S = one variable from the home  sanitation category
      P = level of air pollution
      Cl = climate
      e = the error term
                                    39

-------
       Frequently, hypotheses such as these are considered proven or
 disproven simply on the basis of the statistical confidence of the partial
 correlation or of the regression coefficient. In this analysis,  it should
 be emphasized that  we have some  definite expectations about the direction
 of the relationships.  These linkages are most important for the pollution
 variables  and particularly for water quality.  We would expect both water
 quality and air quality to be negatively related to  absenteeism. Except for
 some narrow ranges at very high levels of purity, it is  not reasonable
 that higher quality water or air  should result in increased illness.   '

       For the other variables, we would expect younger  populations to be
 healthier; non-white population to  be less healthy; wealth should be nega-
 tively related to absenteeism; density should be positively correlated ; and
 home  sanitation should be negatively linked with health.  Finally, given
 the pressures of time and, frequently, income  on  single  parent families,we
 would expect absenteeism to be  higher in single parent families than in
 the more traditional husband-wife family.

        The socio-economic measures would be used in the analysis in an
  additive manne'r, but only one variable per group was permitted in the
  final regression. These were selected using standard stepwise regression.

  Analytical Methodology

        The primary  statistical tool which we employed in the study  is step-
  wise multiple regression. This is a technique which fits well with  the
  general assumptions  that health is a result of the simultaneous effects of
  multiple factors. The methodology provides us with a set of coefficients
  which indicate the expected magnitude change in  the  dependent variable  as
  a result of a given  change in one of the independent variables.
45)  At extremely high levels of purity, small increases in bacteria levels
     may serve as boosters to keep anti-body levels high enough to ward off
     more serious infectious material.  This fact is particularly important
     for those persons  who are a part of a stable population.
                                     40

-------
      In the current study,  we are primarily concerned with the health
effects  of water quality and have taken considerable effort to eliminate any
serious problems of multi-collinearity. Since we are less concerned with
the individual (as opposed to combined) effects of the control variables, we
placed less effort on maintaining statistical independence within this  group
of variables. We did,  of course,  take the effort to insure against linear
independence of the control variables.

Missing Data

      In nearly any statistical analysis using surveys or multiple data files,
missing data becomes a problem.  There are three logical approaches which
can be used to solve the problem only two  of which are worth serious con-
sideration.  The one logically feasible but impractical solution is to ignore
the problem and run the analyses as though the data were present. This
alternative inserts zeroes in the missing cells and thereby gives the
missing cells arbitrarily incorrect values.

      The second alternative is to simply omit any case from the analysis
for which there are one or  more missing data cells.  This alternative
has two negative  effects •which under  some circumstances are not
particularly difficult to accept. The first is that the sample size is reduced
thus reducing the degrees of freedom available for analysis. The second
is that if missing cells are systematically distributed, the actual
characteristics of the  sample are altered by eliminating a non-random
subset of  cases

      The third alternative is to estimate the missing cells  given informa-
tion already in the  data set. For  example, a data set in which age, income,
race, education,  and sex were independent variables, a missing "income"
cell could be statistically predicted using the remaining four variables.  The
method is used most frequently in studies  having a rather numerous  random
distribution of missing cells. For example,  if each of five variables  in a
data set had ten percent randomly distributed missing data there would be
. 5N empty cells. This logically  could eliminate  as many as one half  of the
cases.

      The difficulty of estimating missing cells is that to do it reasonably
successfully creates a conflict of the assumptions of the various techniques.
Multiple regressions,  as we pointed out above,  assumes the independent
variables to be independent of each other.  The missing data routines, on
the other  hand, assume that the  more highly intercorrelated the independent
variables are the more accurate  the estimates are. Thus,  the closer one
moves toward the ideal assumption of regression, the less accurate  the
                                  41

-------
missing data estimates become.  There is,  admittedly,  a gray area in which
some researchers believe that the technique is preferable to eliminating large
numbers of cases .

      In the  current analysis,  there are two significant types of missing data.
The first of  these are the census variables which, if any are missing, are
generally  all missing because we were unable to find the community on the
master enumeration district list. Normally  this occurs in very small un-
incorporated communities which have their own elementary school.  '

      The second primary source of missing data is in the water quality/
facility inventory variables. In some instances, the water quality  measures
were not present.  For our sample this was relatively infrequent. More
frequently,  some of the questions concerning characteristics of the operators
or failures went unanswered. These missing cells tended to be scattered
although there was  a slight tendency for some respondents to refuse to
answer several questions.

       Given the above configuration of missing data and given that the water
 quality/facility inventory variables were of primary significance (and,
 therefore,  should not be  estimated) we chose to eliminate those cases
 which  had any missing data on the particular variables being analyzed at a
 specific time. Clearly this would change the sample size as a function of
 the particular variables being employed in the analysis.  For example,
 consider  the following hypothetical data set:

                           Var.  1   Var.  2   Var.  3     Var. 4
       Case 1                 1        1          0        25
       Case 2                 2        5          0        19
       Case 3                 313
       Case 42                   3
       Case 5                 3        1          0        10
       Case 6                 18          60
       Case 7                 1        6         12         8
       Case 83                            32
       Case 9                 115
       Case 10                12          84
46) If there were no elementary school,  there would be no dependent viiriablc
    and the case would not even be included in the data set.
                                   42

-------
      If variable  1 were the dependent variable and variables 2, 3, and
4 were independent variables, cases 1,  2,  5,  6, 7,  10 would be examined
because they have complete data on all three variables.  If,  on the other
hand, variables 2 and 3 were the independent variables, cases 3 and 9 would
be added to the above list because  there is data for them for variables 2
and 3 but  not 4.

      In most circumstances,  missing data tended to occur in the smallest
communities and those communities with at least one  piece of missing
data tended to have several missing pieces. The deletion of cases with
missing data, therefore, biased the sample in the direction of larger
communities relative to the CWSS sample.  There is , of course, no
assurance that the CWSS sample was itself randomly selected from any
larger population.

Case Selection and Aggregation

      The  CWSS,  as  mentioned before,  covered 969 public water systems
across the United States. We were unable  to individually examine some of these
for three basic reasons:
      1) Two or more public systems supplied one community.
      2) A public system served a community which did not have an
         elementary school.
      3) The system was in California and the school's records were
         not easily accessible.

      If two or more water systems  supplied a single community, a weighted
average water quality figure was computed using the number of people served
as the weighting figure. For example, consider the  following typical hypo-
thetical situation of three systems supplying one medium sized community.
                          Population Served    Total Plate Count
      System 1
      System 2
      System 3
      Community means =
 2000
 4000
10000
16000
32 x 10<
                               16 x 10
  3000
  4000
  1000
  8000

2000
 P x C

 6 x 10J
16 x 10
10 x 10
32 x 10
                                   43

-------
    A large number of the systems examined in the CWSS provided water
to very small communities,  (under 500 population)  or to special populations
such as hospitals, trailer parks,  camping grounds,  boarding schools,
nursing homes, etc.  Of these, all special systems and systems  serving
those smallest communities with no elementary schools were dropped from
the  study.

    The State of California was one of the states which does not keep
centralized absenteeism records.  Therefore, individual schools were
contacted to collect the absenteeism data for the school year  1969-70.
Because absenteeism in California means an unexcused absence, we
did  not believe that extensive collection efforts were warranted.  However,
every school district serviced by one  of the water systems measured by
the  CWSS was contacted. If the data for a given school were  not easily
accessible, we did not  request the school's  officials to extend themselves
to provide it.
                                  44

-------
The Test

A General Model

      Despite all of the potential pitfalls of a general model, we found the
results encouraging.  The bivariate correlations showed the
following variables to be most strongly linked to absenteeism.


      Variable                 Correlation

      Precipitation               . 40
      Vermont  Dummy          -.36
      Temperature               . 34
      New York Dummy          . 30
      Urbanization               . 26
      Frequency of Testing
        of Finished Water        -. 24
      Louisiana Dummy          . 20
      Operator Salary            . 19

      Particular attention  should be paid to the incorrect signs for the
relationship between temperature, operator salary and absenteeism. We
shall find shortly, that these problems disappear when we control for
urbanization and regional variation.  It is  also worth identifying those
variables which did not appear in the list of significant bivariate re-
lationships.

      Variable                 Correlation
      Total Plate Count           . 09
      Home Value                . 04
      Living Density              . 02
      Shared Bathroom           . 02
      Presence of common
        walls with inferior
        water                     . 01
      Presence of Chlorine
        Resident                  .03
      Air Quality (particu-
        lates)                     .07

-------
 In the general  model, none of these variables ever  showed significance
 despite controls for other variations.   '

       After running the bivariate correlations,  we examined the data using
 stepwise regressions. Because of the  complex  intercorrelations in the
 data, we ran the regression several times using different variables. The
 overall findings from the regressions of all cases is that controlling for
 urbanization and regional variation,' the only consistently significant
 variable is operator salary.   The  frequency of testing finished water is
 consistently   related in the proper direction although the  relationships are
 weak.  The regression coefficients  are  in the correct directions in all
 cases.  Operator salary  is generally significant at the . 10 level.  In some
 instances, however, its  significance drops to . 20  .  As we mentioned, the
 frequency  of testing is a  weaker variable and is,  at best, significant at the
 . ZO level.   Interestingly, the product of operator quality and frequency of
 testing  is significant at the . 05 level in the following regression equation.

       Multiple R = . 33    Significance  of the Regression = . 001  R-Squared = . 10

       Variable     Partial       Coefficient  Std. Error  T-Stat.  Significance
                     Correlation

       Constant                       5.7        .57       10.0        .001
       Percent in
         Urbanized
         Area           .09           .8        .7          1.2        .25
       New York
         Dummy        .20          1.5        .6         2.7        .01
       Louisiana
         Dummy        .26          3.4       1.0          3.5        .001
       Fqx  Op48       -.15         -.08        .04      -2.1        .05
47) All of thein were significant in at least one regional analysis.

48) An index comprised of frequency of testing finished water times
    operator salary.
                                    46

-------
      We would argue frequency of testing times the  salary of the operalor(s)
is a powerful index of the operation of a water treatment facility. Whereas, a
weakness in one or the other variable may not necessarily mean poor
operation, infrequent testing coupled with an underpaid operator almost
certainly signifies substandard operation.

      Another interesting point is that although temperature and precipitation
showed significant correlations in the bivariate analysis, in no instance
are they even close to  significant in the  regressions.  Clearly, whatever
variance climate was able to explain at  the bivariate  level was better
accounted for by the  regional dummy variables in the regressions.

      The general model clearly includes a great deal of uncontrolled or
semi-controlled variance. The correlations are  not particularly strong
and a multitude of sins are buried in the regional dummy variables.  None-
theless it is very encouraging to note that variables linked to operation of
water treatment facilities are consistently stronger than any of the other
competing specific variables such as home value  (a surrogate for income),
density, cultural factors, etc.

Regional Analyses

      In an  attempt to gain additional insight into the relationships between
drinking water and health, we subdivided the sample into more homogeneous
sets.   In' choosing these subsets it was nec.essary for us to balance the
homogeneity of the subset against the number of cases in the  subset.  On the
one hand, we wanted to have as great homogeneity as possible while also
retaining enough cases to permit meaningful analyses. In doing this we
finally decided upon four subsets two of which  are quite homogeneous, the
other two of which are  less so by reason of necessity.

      The homogenous  subsets are Vermont and the  greater New York  area
excluding New York City.   The former includes all of the analyzable CWSS
communities in Vermont.  The data cover nearly every county in Vermont
a-nd systematically exclude only the rural-farm population.  We had 82
communities with complete data.

      The New York data  cover Westchester,  Nassau, Suffolk and  Rockland
counties. The communities studied are  generally suburban  and  smaller
urban  centers.  New York City was  excluded because it has an extremely
serious unexcused absence problem.  After examining the New York City
data rather  closely, we concluded that uncontrollable variations in unexcused
absence made the absentee data hardly meaningful as an indicator  of
illness.  Excluding the New York City data we had 54 cases  with  complete
data.
                                  47

-------
      For the rest of the areas covered by CWSS,  we did not have sufficient
cases to permit truly regional subsetting. Therefore,  we left the areas
together and subset them only by level of urbanization.  Urban and  sub-
urban communities •were placed in one category.   Rural communities as
classified by the Bureau of the Census, were placed in the other subset.
We had 62  cases with complete data in the urban/suburban subset and 47
cases in the rural subset.

      Summarizing, we had four subsets which we subjected to further
analysis. They are listed below.
Name
(1) Vermont
(2) New York
Cases with
Complete Data

      82
      54
(3) Six Area Rural
      47
(4) Six Area  Urban
      62
Comments
Includes data from most Vermont
counties.  Generally  small
communitie s

Four counties surrounding New York.
Includes small communities and
suburbs with a few medium sized
cities.

Rural communities surrounding
Cincinnati, Ohio,  Charleston,  W. Va. ,
Charleston, S. C. , Kansas City, Mo. ,
New Orleans, La. , Pueblo, Colorado

Urban and suburban  areas including and
surrounding six areas in  (3)
      The results of the final regressions for the four areas are shown below.
The interpretation and discussions will follow the presentation of the
equations.
Vermont
      N = 82
      Significance of regression = . 97
      Multiple R = . 508  R-Squared = . 259

      Absenteeism = 5. 04  -  .568  -.33Ic +16.48D
                                  48

-------
                                                Standard Error of
                              Significance       Coefficient
S   '  =   Salary of operators
         (index)                    .99                   .14

  50)  = Interconnections with
Ic
         poor quality systems      .85                   .21


D =      Percentage of living units
         with more than  1. 01
         persons/room             .90                  8. 8

New  York51)

      N = 54  Significance  of Regression - . 99

      Multiple R = . 614  R-Squared = . 377

      Absenteeism = 7. 02  - 11.4 Env + 34. 25D0 - . 000033H - . 53R
                                                                 u
                                                 Standard Error of
                              Significance        Coefficient
      Env = Environment
             Index                  .98                   4.4

      D  = Percentage of
            living units "with
            greater than 1. 5
            persons/room          095                   16.2

      H =  Home value              .99                     .00001

      R  = Percent rural          .85                     .34
        u
49)   1 = 1999, 2 =2000-4999,  3 = 5000-7499, 4 = 7500-9999, 6 = 10000 or
50)   1 = Y, 2 =N
51)   Computed by dividing index of operator salary by average particulates.
      The result is that a high value indicates high environmental quality.
                                  49

-------
Six Area Grouping - Rural

      In the above mentioned six areas, the CWSS studied water systems
in 57 rural communities under 2500 persons which also had an elementary
school.  Of these,  10 had missing data on at least one of the variables
which were finally entered in the  regression.  The regression results are
summarized below.

      N = 47    Significance of Regression = . 99

      Multiple R = .501 R-Squared = . 251

      Absenteeism = 13.9 - 1.04F  - 1. OS
      p52) -. Frequency of testing
          of finished water

      S  ' = Salary of operator (Index)
Significance

      .99

      .90
Standard Error
of Coefficient


      . 32

      .54
      In this subset we tried to employ the index of frequency of testing
times salary of the operator.  The product was significant at the .99
level but the total explanatory power of the regression was smaller than
when we employed the additive model. Because the additive model is
easier to extrapolate and because statistical confidence for the  additive
model was quite satisfactory we remained with that  solution.
52)  1  = each month, 2 ~ each year,  3 =2 years
     4=3 years, 5 = infrequent,  6 = never

53)  1  =  < 1999,   2 = 2000-4999,  3 = 5000-7499
     4  =  7500-999, 5 = 10000 or
                                  50

-------
Six Area Grouping - Urban /Suburban

      N = 62   Significance of Regression = . 99

      Multiple R = . 79  R-Squared = . 62

      Absenteeism = 3. 81  + . 0006C +  . 69R + 54. 6T + 1. 4K

                                               Significance

      C = Total plate count                        . 99

      R = Presence of Chlorine residual            . 95

      T = Percentage of living units sharing
          bathroom facilities                      . 99

      K = Kansas City Area Dummy                . 90
      In examining the results there are two facts which are initially
apparent.  The first is that  the strengths of the relationships are con-
siderably  stronger for the subsets than for the general model.  Multiple
R's range from a low of . 501 in the six area-rural to . 79 in the six
area-urban/suburban, as compared to multiple R's in  the . 30-. 35 range
for the general model.  The significance  of the water-related variables
is also appreciably higher.

      The second major point to observe  is the fact that  in all regressions
except one,  operator  salary is a significant predictor  of absenteeism.  In
Vermont and the six area-rural areas the relationship is additive with  one
other measure of plant operation.  In New York,  operator salary interacts
multiplicatively with air quality.   It would appear that where there is
appreciable  air pollution, it lowers resistance to disease such that contaminants
in the water will more easily produce  illness.

      Although operator salary is a strong predictor of elementary school
absenteeism in the national sample and three of the subsets, it is not linked
to absenteeism in  the six  area-urban/suburban grouping. Here we find
chlorine residual and total plate count as the primary  factors.  Because it
is only in this isolated sub sample that these variables  appear,  we have to
be somewhat skeptical of  the confidence we can place in  them.  Caution is
particularly important when we recall that the direct measure of water quality
•was taken  only at one point of time. Thus, the reliability of the independent
                                  51

-------
variable in this regression equation is in question.  Keeping these
important caveats in mind, the findings do make sense if we were to
hypothesize that for some reason, the contaminants in the water in
this subset arose from systematic non-human related problems.  For
example, if there were some serious problems with the distribution
systems, we could argue that the one time measure of total plate  count
would, in fact,  identify the presence of generally contaminated water.
Similarly, any facility problems  such as these would be more likely to
produce  a steady level of contaminants than would operator malfunctions.
If for some reason, the  six area  urban/suburban systematically has
different problems than all other areas,  the  results have meaning. Again,
however, these results must be taken with caution.

      Across the total sets of results, we find that urbanization and geo-
graphical location appear to play predominant roles. These,  in fact,
prompted subsetting in order to control for their  associated complex
variances. Aside from these factors there is the  unmistakable presence of
something we might call  housing  quality.  One or more of the variables
measuring home value,  occupants/room, and the frequency of shared
bathrooms occur in three of the four subsets. The importance of these
variables is mostprominent in the Vermont and  New York  subsets.  As
the models covered broader geographic ranges  (i. e. ,  six  area groupings
and the general model) the  effect of housing variables tended to wash out.
This is not particularly surprising given the  geographic differences in
property and housing values and types. ^

      Notably,  none of the  climatic variables enter into any of the equations.
It should be noted that weather  variables play a strange role in studies which
are not time series oriented.  Although from one month to the next,  weather
variation is known to influence  absenteeism, this variation can be seriously
obscured by high level aggregation.  This is particularly important when the
communiti*?*? being studied  are  geographically proximate  to each other.
For example, we would not expect the weather in Burlington,  Vt. to be
much different from Rutland Vt.  We know,  however,  that absentee rates
do differ between these two communities. Weather may theoretically
explain a good part of the  stable  base of absenteeism but  it cannot reason-
ably be expected to explain the  differences between two geographically proxi-
mate communities.
      Even in the six area groupings, we have only 3 or 4  appreciably different
climates. Here, one  results show that the influence of  climate is subsumed
in the geographical dummy variables. These latter variables apparently
incorporate more than climate  and are, therefore, the stronger predictors.
54) For example,  it is difficult to assume that differences in home
    values between Mt. Kisco, New York arid Abita Springs,  Louisiana
    reflect differences in standard of living.
                                   52

-------
       In concluding the interpretation,  we would point out that in both the
 general model and in three of the subsets,  operator salary shows a strong
 and consistent relationship to elementary school absenteeism. There  are
 some differences from region to region but they basically support each
 other. The one significant  deviation occurred in the six area urban/
 suburban grouping in which operator salary did not enter at all but was
 replaced by chlorine residual and total plate  count.  Given the data
 collection difficulties,  one would have to intrepret this one deviation with
 considerable  caution.  If the results of the six area urban/suburban
 grouping are meaningful, they must be taken to indicate  some non-human
 problem which
 wate r.

       As support for the major findings concerning operator salary, we
 should recall that the CWSS reported that 84  percent of the principle
 operators were earning less than $7500 per year and that 37 percent
 were  earning less than $2000.  With such poor salary structures for
 operators, it is not  surprising to find this as a significant element in the
 production of poor quality water.

       The fact is quite  significant. While the regression equations broadly
 predict an improvement for all regions studied of . 2 to .5 percent in attend-
 ance for each $1000 increase in operator salary,  we should not assume  that
 this means that simply by paying the operators more money, we will improve
 performance by that amount.  While there may be some improvement in perform-
 ance with increased salary, the problem is obviously much deeper than that.

       Given 1969 general salaries in the areas we were examining,    the
 majority of operators were in the lower 50%  income range.  This simply
 makes recruitment of individuals with college training in-chemistry,
 microbiology aid related subjects quite  difficult.   In fact,  the CWSS showed
 that over 60% of the  operators never had even a short course in water treat-
 ment.  To  a larger extent, the public's  image of municipal water treatment
 operators is often much closer  to a manual laborer than  to a professional
 with a significant role in  community health. According to some people with
 whom we discussed the  problem, the primary role of the operator is to keep
 the machinery working  properly. While this is an important function
 performed in a water treatment facility, it hardly  seems reasonable  that
 that it be the primary role of the principle operator.

       Somewhat illustrative of the type of problem encountered is the
 following anecdote reported by Harris and Breckner : 56
55)  The averages fluctuated from area to area between $6000 and $8000
    per year per worker,

56)  Harris,  Robert and Edward Breckner,  Consumer Reports,  June 1974,
    p.  436-443.
                                   53

-------
                 The operator in charge of one New England
                 community water supply,  for example, heard
                 somewhere that activiated carbon will remove
                 foul tastes and odors.  So, when the water
                 smells bad, he fills a cloth bag with activated
                 carbon powder, ties a rope to it, climbs into a
                 rowboat, and tows the bag back and forth across
                 the town reservoir. This  is about as  effective
                 as waving a wand over the reservoir.

      Obviously to simply increase salaries would not be enough.  There
would have to be a program to upgrade the operator quality through training
programs and,  in the long range,  improved hiring.

      Another important water-related variable is the frequency of testing
of finished water This is significant at the . 99 level  in the rural areas
not including New York and Vermont.   In the general  model it consistently
appears with the proper sign although the statistical significance is erratic
across different regressions. This finding is also consistent with a major
deficiency uncovered by the CWSS.  In their report, they discussed the fact that
90 percent of the systems studied failed to meet the bacteriological sur-
veillance criteria.

      Finally, in Vermont there is a weak relation ship between absenteeism
and systems with interconnections with water systems of unknown quality
without protection.  This problem  is relatively rare showing up in  only six
percent of the total  systems examined.  Interestingly, thirty-five  of these
were located in Vermont.

Effects

      The effects of degraded drinking water can be expressed in a number
of ways.  The regression equations themselves are the most abstract form
of this effect. They essentially provide an estimate of the percentage change
in absenteeism» per unit change in the  independent water-related variable(s).
In the following section we will translate these into more concrete forms
moving progressively from  the soundest conclusions to the most tenuous.
We will initially calculate the magnitude reduction of elementary school
absentees in those counties  studied resulting from incremental improvements
in drinking water quality.  'The above findings will be expanded to the; dollar
57)Because of the tenuousness  of the six area urban findings, we will not
   extrapolate the costs for this group.

                                   54

-------
value of lost wages and medical bills  saved due to reduced absenteeism/
illness.  Finally, making   some very bold assumptions, we will extra-
polate the total dollar sayings to the entire country.

Elementary School Absenteeism - Vermont

      Because the CWSS covered nearly every county in Vermont,  we
believe  that the findings apply to all except the rural-farm population of
the state.  The study applies to a population of 409,000 and an elementary
school enrollment of 66, 600.  The regression coefficient of the variable
"salary of operators" is . 56,  and the coefficient for "interconnections
with unprotected systems" is . 33.   Because the salary variable was
coded in a variable interval manner,  we can strictly say only that  an
increase of one point in the salary index will  decrease  absenteeism by
. 56 percentage of the total student days. Assuming a 180 day school
year,  there are  12, 000 student days per year and each unit increase in
the salary index  would result in an  estimated  67, 100 fewer absences
from elementary schools.  The standard  error of the regression co-
efficient is . 14.  Thus, a reduced absenteeism of from 50, 000 to
84, 000 days is within the plus or minus one standard deviation unit.  A
range of 33,600  to 101, 000 is the range defined by plus or minus two
standard units.

      Since the salary index intervals are  roughly $2500 increments,  we
introduce only a  small error by translating these into actual salary values.
Doing this we would estimate that,  state wide, every $1000 increase in
the average  salary of the principle  operator would reduce total elementary
absenteeism by 27, 000 days per year.

      The  interconnections with unprotected systems have a regression
coefficient of . 33 indicating that for each ten  percent of the total systems
which eliminate  the problems, absenteeism rates will be reduced by . 033
percent of the total  school days. This translates to an  increase of 4000
student days per year. The standard error in the regression coefficient
is .21. The  comparable range of reduced absenteeism for one standard
deviation is  1500 to 5500 per ten per  cent increase in eliminating linkages
with unprotected systems.  The range for the  plus or minus two standard
units is 0 to 9000.

      If we are to draw any inferences about the  linkages between  drinking
water quality and adult absenteeism,  we must be able to make some
                                  55

-------
assumptions about the relationship between adult absenteeism and
elementary school absenteeism. Our first assumption is that the  portion
of elementary school absenteeism explained by drinking water variation
is a subset of acute illness.  That is, we are  not interested in other types
of absenteeism  such as chronic illness,  broken bones,  or other types of
absence which could not even logically be related to drinking  water.

      Obviously,  any discussion concerning increases  in operator salaries
implies that  it is the lower salaries  which are most in  need of improve-
ment.

      The  second assumption is that across a large population, the
occurrence of acute illness in adults  is a responsibly stable  proportion of its
occurrence in children.  This does not imply  a one-to-one relationship nor does
it imply that the constant of proportionality is the same for any specific
adult-child pair.
                                                                      A.
      Finally, we assume that this constant is measured by the ratio	
                                                                      (_>.
where A. is the  average, across the population being studied, of annual  l
adult days  lost from work with infective and parasitic diseases and
digestive system conditions.   C. is the counterpart for children's  days
lost from school. We have eliminated respiratory  conditions  from the
ratio, not  because it is impossible to communicate them via  drinking water,
but because the  others  are those most commonly associated with  drinking
water.

      The  National Health Survey of 1969 found that there were 100. 9 days of
work lost per 100 employed due to the  above  described acute conditions.
They found that there were 163. 8 days lost from school per 100 students
due to the  above conditions. A check  across  an SMSA vs non-SMSA
population  showed that  despite different rates  of bed disability with acute
illnesses,  A./C. ratios differ by a maximum of .006. Checks across
            i   i
different age groups show that in the  15-14 vs 45-64 age brackets, the
ratios are  identical. Given the other  error in the data,  we believe this
difference  to be insignificant.  Given that, we  will use the ratio      "  ~ . 616
                    *                                          163.8
as the constant  of proportionality.  This means that we  expect that the
percentage of estimated water-related absenteeism in the adult population is
. 6l6 times that  of children in elementary school.

      Additionally,  the survey found that,  excluding hospital  care, dental
bills, and  special expenses, persons •with no chronic illnesses spent an
average of $69 per year on doctors and medicine.  The  non-respiratory and
non-injury acute illnesses account for 30 percent of the total  disability
                                 56

-------
     CQ\
days.    Thus,  we estimate that the combined doctor and medicine expenses
per person per year for the acute illnesses which we are  considering is
roughly $20. 7.

      The  labor force in Vermont to which this study is applicable numbers
153, 094 with a total income of 873 million dollars  per year.  Each one
point increase in average operator  salary index should reduce absenteeism
by . 34 percent of the total worked days.  Annually for the  total state,  this
would mean a savings of $2, 980, 000 in wages or a savings of roughly $20
per working  person per year for each average increase of one point on the
operator salary index.  Using the rough  conversion factor of $2500 per
index point,  the savings are $1, 190, 000  or $8. 00 per worker for each
$1000 average increase in operator salary.

      In the area of interconnections with unprotected systems,  the savings
is considerably less.  For each ten percent average improvement in the
interconnections,  we can expect a reduction in  absenteeism of . 02 percent.
Statewide this amounts to a  savings of $170, 000 per  year  in wages or slightly
more than $1 per -worker.

      Total medical costs in the illness with which we are concerned are
$9.6 million dollars.

      Since each $1000 average salary increase reduces absenteeism by
.  136 per cent of the total worker days, this  accounts for   * "0* ^o  =  . 055
                                                         . 25
of the illnesses in the acute non-respiratory, non-injury category. This
amounts to a savings in medical costs of $545, 000 per year per $1000 average
salary increase.  Similar calculations show a savings of $77, 000 per year per
ten percent improvement in interconnections with unprotected systems.

      The  total  savings (salaries plus medical costs) are 1.7 million dollars
per year per $1000 average increase in operator salaries and $247,000 per
year per ten percent improvement in the interconnections.

"New York

      The  New York study examined non-farm  communities  and suburbs in
Nassau, Rockland, Suffolk and Westchcster  counties. We also examined
New York  City,but for  reasons explained above, had to leave it out of the
final analyses.  The applicable population in  the  above-mentioned com-
58)National Center for Health Statistics, "Personal Health Expenses -
   Per Capita Annual Expenses, United States: July-December 1972,"
   U. S. De partment of Health, Education and Welfare, Public Health Service,
   Series 10,  Number 27, February 1966.


                                 57

-------
munities number 3. 67 million persons and 645, 000 elementary school
students.  The final regression employed an index of environmental quality
formed by dividing the index of operator salary by the annual average
particulates in  the air.  The range of the index is  from . 12 to . 01. The
regression coefficient for the  index is  11.4 with a standard deviation of
4.4. Thus, for  each  . 01  change in the  index, absenteeism will drop by . 11
percent of the total population.

      With the 118 million student days under consideration, this means
that for each improvement of  . 01 in the environmental index, absenteeism  in
these four counties will drop by 133,000.  The one and two standard
deviation unit ranges respectively, are 81,000 to 185,000 and 31,000 to
235, 000  student days per . 01 improvement. Interpreting the salary index
of $2500 per increment,  an average improvement of $1000 in the salaries
of the  operators will,^9)  holding air pollution constant, reduce absenteeism
by 71, 500.

      The New  York population which is affected by the study, totals 3. 7
million persons with a working force of 1. 5 million and a total income
of $15. 6 billion. Using the environmental index comprised of the index
of operator salary divided by  average particulates, the adjusted regression
coefficient for the  working population is 7.25.  Thus,  for every .01  increase
in the  index, worker absenteeism will fall by . 07 percent of the  total
population. For the population under consideration this means a savings
in lost wages of $10, 900, 000 per average index improvement of one per-
cent. This averages $7. 5 per worker in the four county areas surrounding
New York City.

      Assuming average and unvarying particulate counts, this translates
to a total savings of  $5. 95 million dollars per year per $1000  average
increase  in operator salary.

      Medical expenses for the illnesses  under consideration in  the four
county area total 94  million dollars per year. Each $1000 average increase
in operator salaries under the above air pollution conditions will produce
a . 038    percent reduction in absenteeism, due to our subject of acute
   2. 5
diseases.   This amounts to a  savings  of 1.42 million dollars per year.  Thus,
the total savings per $1000 increase in operator  salaries is  7. 37 million
dollars per year.
59)  This is the empirically observed average.
                                  58

-------
Six Area Group-Rural

      In the other six metropolitan areas examined in the study,  we divided
our sample into urban and rural sectors.  The applicable rural population
affected is 480, 000 with 82, 000 elementary students. Both frequency of
testing of finished water and operator salaries are strong predictors.

      The regression coefficient of testing of finished water is 1. 04. The
variable is  scaled into  intervals roughly equal  to one year.  The regression
coefficient for operator salary is 1. 0.

      There are  14. 8 million  student days covered by the sample.  For each
average increase in the frequency of testing finished water  we can expect
an increase of 160, 000 student days attendance in these areas. The one
standard deviation range would be 113, 000 to 208, 000 and the two  standard
deviation range would be 65, 000 to 255, 000.

      The expected  reduction  in absenteeism resulting from increased
operator salary  is 148, 000 student days per one point average increase in
the salary index. This translates roughly  to an increase of 63, 000 student
days attendance per $1000/year increase in the average salary.  The one
and two standard deviation ranges are 31, 000 to 95, 000 and 0 to  127, 000,
respectively.

      The population of this six area group in the  study is 479, 000 with a
labor force of 170, 000 and a total income of $1". 2  billion dollars. The
primary predictor variable is the frequency of testing finished water with
an adjusted regression coefficient for the adult population of . 64. Thus,
for each one year average reduction in the  frequency of testing,absenteeism
will be reduced by . 64 percent of the population. This amounts to $7. 2
million or approximately  $45  per worker.

       The second significant variable is the salary of the operator which
has anadjusted regression coefficient of .616 .  This would lead  us to
predict a savings of 2. 9 million dollars annually for each $1000  average
increase in operator salaries. This is a savings of roughly  $18 per
worker per year.

       The related medical expenses for this group is 9. 9 million dollars
per year.

       The medical savings per average one year improvement in testing
frequency is 2. 6  million dollars per year.  A $1020 average salary increase
will produce a savings of $970, 000.
                                  59

-------
      The total savings for improving the testing procedures are  9.8
million dollars per one year increased frequency.  The total savings
resulting from a $1000 operator salary increase is 3.9 million dollars.

National Extrapolation

      The above  calculations have been based on a limited sample of 8.4
million persons,  concentrated   within eight relatively small geographical
areas.  The original CWSS study of nine areas made no claims that the
sample was representative of a  larger universe. The sample was reason-
ably diverse geographically although only three of the nine were west of
the Mississippi River.

      It included a diversity of economic and cultural characteristics and
covered a wide spectrum of community sizes.    Despite this,  they did not
choose a scientific sample from which we  can generalize to the total
population with any known degree of statistical confidence.  Thus,  it is
only prudent for  us to advise the reader that the following national estimates
are based only on the unsupported assumption that the sample has drinking
water  vs. health characteristics which are  similar to those of the entire
nation.

      We will compute the rational extrapolations  from those of our models :
(l)the general model,  (2) the Vermont model,  and (3) the six area rural
model.  The six area-urban/suburban model is too,tenuous to permit
such extrapolations.   National extrapolations from the New York model are
seriously hampered by the strong interactions with air pollution.

      The Vermont findings of the effect of interconnections with unprotected
systems is difficult to generalize given that the CWSS found that across
these total samples only  six percent of the cases  have this  problem. We
will,  therefore,  not extrapolate this portion of the Vermont results.

      The national estimates are straight  line extrapolations using the
total number of enrolled  elementary students as the basis for elementary
school absenteeism,  total wages paid as the basis for total lost wages
saved and total population for special medical costs.

Extrapolation from the General Mode]

      Recalling that in the  general model,  we found that the idea of operator
salary times the frequency of testing was  the strongest water related
variable.  The  regression coefficient was . 08, meaning that for every 1. 0
reduction in the  index value,absenteeism would be reduced by . 08 percent
+ . 08 at the . 05  confidence level.  With 5. 4 billion student days nationally,
                                  60

-------
an improvement of 1. 0 in the index would reduce elementary school
absences by 4. 3 million student days.

      Expanding this to the adult population in 1969 there, were  57. 9 million
persons  in the private sector of the labor force excluding rural  farm workers.
These persons were employed for approximately 15 billion days and earned
nearly 390 billion dollars.  Each 1. 0 reduction in the index of salary times
frequency of testing would  reduce absenteeism due to illness by 7.4 million
days +_ 7. 1 million days resulting in a savings of 194 million dollars + 186
million dollars from wages alone.  The total of 7.  4 million days of worker
illness plus 4. 3 million days of school illnesses means a reduction of
11. 7 million days in sub-clinical illness apparently resulting from drinking
water.

       The National Health Survey reports indicate  that illnesses in this
general  category cost an average of $5.46 per illness, including medicine
and physicians. We do not know the average length of illness in this category.
Assuming,  however, that it is  close to 1. 0 days per illness, and accepting
the very gross assumption of the national extrapolation,  let us estimate a
cost of $5. 00 medical costs per day of sub-clinical illness. Given one
estimate of 11. 7 +_ 11. 2 million days  illness, the medical savings are
$58. 5 million dollars +_ 56 million dollars.  From the general model,
therefore, we would estimate savings of 252 million dollars per year +_ 242
million dollars for each 1. 0  improvement in the index.

       It  should be remembered that the composition of the index makes
substantive interpretation somewhat difficult.  Since  better (more highly
paid) operators could be expected to  engage in better testing procedures,
we cannot expect that realistically we can hold testing constant  while
improving operator quality (salary).  Similarly we  probably do not expect
a general improvement in practice until municipalities decide to improve
what are clearly inadequate salaries.  Since the relationship in the general
model is greatly strengthened by the  multiplicative term, the reader must
realize that a 1. 0 increase implies a probable improvement in both
elements.

Vermont Extrapolation

      .Although we did not exclude urban areas  from the Vermont studies,
those which were included were so small relative  to the  other large
metropolitan areas  in the country, that it appeared more reasonable to
extrapolate only to the rural non-farm portions of the United States.
We are extrapolating to a working  population of 12.4 million persons
earning  70. 9 billion dollars.
                                   61

-------
      The Vermont analyses applied to 409, 000 persons and concluded that
an average increase of $1000 per year in operator's salaries would reduce
elementary  school  absenteeism by 27, 000 days per year and save 1. 7
million dollars per year in -wages and medical expenses.

      To the extent that these findings  are applicable to all rural non-farm
communities there would be  a nationwide reduction of 3. 1  million elementary
school absentee days per year per $1000 average increase in the salary
of the operators. This  is based on a nationwide  rural non-farm elementary
school population of 7. 7 million  students.

      The reduction in  lost wages would be 138 million dollars per year
and the savings in medical expenses would be 50 million dollars . Thus,
using the Vermont  model the total savings to the rural non-farm popula-
tion per average $1000 increase in operator  salary would  amount to 188
million dollars per year.

Six Area Grouping  Rural Extrapolation

      In the six area rural grouping we found that operator salary was a
significant variable with a regression  coefficient of nearly twice that of
Vermont.

      Applying those findings to  the total U.S.  rural non-farm population,
we -would predict a total reduction of 5. 5 million absentee days from
elementary schools per $1000 average increase in  operator salary. We
would also predict  a  savings of 246 million dollars per year in lost wages
and another 80 million  dollars per year in medical expenses.

      In the six area groupings,  we found that the  frequency of testing
finished water was also quite significant.  In the directly affected 480, 000
population,  we predicted a reduction of 160,000 days elementary school
absenteeism and a  savings of 7.  2 million dollars in wages and another 2. 6
million in medical  expenses for  each 1 year  average increase in the freqency
of testing of finished water

      Extending these'findings  nationwide  to all rural non-farm communities,
we would expect a  reduction of 15 million days elementary school absenteeism,
savings of 599 million  dollars in lost wages and 230 million dollars in
medical expenses.

      If we were to assume a testing improvement of frequency/month  for each,
rather than  frequency/year we would predict  a reduction of 1. 3 million days
elementary  school  absenteeism,  savings of 49 million dollars in lost wages and
19 million dollars in  medical expenses.
                                   62

-------
      Recalling the fact that improvement in salary and improvement in
testing are likely to partially co-occur,  it is not reasonable to compute
a joint savings of salary improvement and testing frequency.

      Again we would urge the reader to be cautious in interpreting
these national extrapolations.   The extrapolations are best estimates
given  existing information but we have only limited basis  for making
any statements of statistical confidence.
                                    63

-------
  Conclusions

       In concluding this report, there are  several points we would like to
  make. The first is that the existence of a relationship between drinking
  water and sub-clinical absenteeism in elementary  schools is  strongly
  supported by the study.  The results  are highly statistically significant
  consistent with our  hypotheses,  and  stable across  different sub-samples.
  Even in the weakest cases, partial correlations were above . 33, thus
  accounting for more than ten percent of the unexplained variance.

       The problem of operator salary is one which can easily be miscon-
  strued. Therefore,  it is worth reemphasizing a point made earlier.
  Although  operator's salaries are strong predictors of absenteeism,
  we  would  caution  against the conclusion that pay raises will immediately
  improve community health.  The astoundingly low salaries have resulted
  in the recruitment of generally poorly educated,  undertrained  operators.
  While in time, higher salaries will make possible  the  selection of higher
  caliber operators, in the short term, the salaries would have to be
  supplemented by vigorous operator improvement programs if any signifi-
  cant result is to be  expected.

       Therefore,  we computed benefits on  the basis of increments in
  improvements in the various measures of water quality. The various
  increments predict  reductions of 3% to 10% of the digestive and non-
  respiratory illnesses.  In the specific areas studied, this translated to
  reductions in elementary school absences on the order of 300, 000 to
  600, 000 absences per year per incremental improvements. Converting
  this to worker absenteeism using a constant of proportionality of  .62,
  this becomes  a savings in the order  of 23 million dollars per year per
  increment."0'  For the nationwide sample,  we would cautiously  estimate
  that elementary school absences would be reduced on the order of
  magnitude of 11-12 million absences per year per incremental  improve-
  ment. The wages and  medical expenses saved were in the order
  of 250 million  dollars per year  per  incremental improvement.
60) These ranges.are very broad and should be interpreted only as "ball
    park" estimates.  The more precise estimates which are presented
    in the body of the report, require breakdown by categories  of drinking
    water problem and geographic area.
                                  64

-------
       In performing the study, we made every attempt to be conservative
 in our estimates.  The medical cost figures,  for example, assume that
 there are no respiratory diseases resulting from drinking water. If they
 were transmitted  in a manner similar to gastro-intestinal illness, the
 medical cost figures would be more than twice what we have estimated.
 At all other points in the research program, when we were faced with
 two alternatives,  one of which would tend to  inflate the results and the
 other of which would tend to deflate them,  we systematically chose the
 more conservative.

       We believe that the results indicate that if improvements are made
 in the quality of drinking water,  substantial savings will be realized. As
 the CWSS report suggested and our study has confirmed, the problems are
 most severe in the rural non-farm communities.   '  There the economic
 savings alone would be most, dramatic both because the effects are re-
 latively large and the costs of resolving the problem are relatively small.
 In most cases, upgrading the personnel and procedures appears to be  all
 that is needed.
61) Rural farm populations are excluded from the conclusions because they
    were not included in either study.
                                   135

-------
                             SECTION VI
                            BIBLIOGRAPHY
Alter, Milton.   "The Distribution of Multiple Sclerosis and Environmental
Sanitation", Israel Journal of Medical Sciences,  1972,  99-105.

Barrett, Larry B.  and Waddell, Thomas  E.  "Cost of Air Pollution Damage:
A Status Report, "  Environmental Protection Agency, Research Triangle
Park, North Carolina, February 1973.

Berg, G. (ed. ).  Transmission of Viruses by the Water Route, New York:
Interscience Publishers (1967).

Borneff, J.  Public Health Aspects of Viruses  in Water,  N.  A. Clarke (ed. )
Cincinnati,  Ohio,  1 (1970), 1.

Chapman,  Robert S.  et al.  "Chronic Respiratory Disease Symptom Fre-
quency in Late Adolescence, In Parents of School Children and in Elderly
Persons, " Presented at the American Medical Association's Air Pollution
Medical Research Conference,  Chicago,  Illinois, October 1972.

Chelsky, Morris.  "The Propagating Rate:  A Method for Interpreting In-
fectious Disease Incidence".  American Journal  of Public Health,
September 1969, 1661.

Chin,  T. D.  Y. , W.  H.  Mosley,  S.  Robinson and C. R.  Gravelle, "Detec-
tion of Enteric Viruses in Sewage and Water,  Relative  Sensitivity of the
Method", Transmission of Viruses by the Water Route, G. Berg (ed. ), New
York: Interscience Publishers (1967).

Clarke,  N. A.   "Viruses in Water, " Journal of the  American Water Works
Association, 61:10 (1969),  491-494.

Cohen, A.  , et al.  "Asthma and Air Pollution from a Coal-Fueled Power
Plant, " Asthma and Air Pollution,  American Journal of Public Health,
September 1972, 1181-1.188.

Craun, Gunther and Leland McCabe. "Waterborne    Disease Outbreaks,
1961-1970",  Presented at the Annual Meeting of the American Waterworks
Association, Colorado,  June  1971.

-------
Dauer,  C. C.  "1951 Summary, Food and Waterborne-Disease Outbreaks".
Public Health Reports, 1952,  1089.

Dohan,  F. C. , G.  S.  Everts,  and R. Smith.  "Variations in Air Pollution
and the Incidence of Respiratory Disease".   Journal of the Air Pollution
Control Association, 1962. 418-436.

Eliassen, R.  and Cummings,  R.  H.   "Analysis of Waterborne Outbreaks,
1938-45",  Journal  of the  American Waterworks Association,  1948, 509.

Enviro Control, Inc.  "National Assessment of Trends in Water Quality"
Final Report submitted to Council on Environmental Quality,  June 1972.

Environmental Protection Agency, "National Water Quality Inventory Re-
port", Office  of Water Planning and  Standards,  November  1973.

Fairchild, Glen A. , Judy Roan, and James McCarroll.   "Atmospheric
Pollutants and the Pathogenesis of Viral Respiratory Infection", Arch.
Environmental Health. September 1972,  174-182.

French,  J.  G. , et  al.  "The Effect of Sulfur Dioxide  and Suspended Sul-
fates on  Acute Respiratory Disease in Children and Adults in Four Areas
of the United States, " Presented at the AMA's Air  Pollution Medical Re-
search Conference, Chicago,  October 1972.

Glass, A.  C.  and K. H.  Jenkins.  "Statistical Summary of Municipal Water
Facilities in the United States", USPHS Pub.  No. 1039. January 1958.

Gorman,  A.  E. ,  and Wolman,  A.  "Waterborne Outbreaks in  the United
States and Canada, and Their  Significance".  Journal of the American Water'
works Association,  31:225,  1939.

Harris,  Robert H.  and Edward M. Brecher, and the  Editors of Consumer
Reports.  "Is the Water Safe to Drink?" Consumer Reports,  June 1974,
436-443.

Harvard University,  "The Economics of Water Supply and Quality".  Proj-
ect No. 16110 DTF, Environmental Protection Agency,  February  1971.

Hill,  William F. , Elmer W. Akin and William H. Benton.  "Detection of
Viruses  in Water:  A Review of Methods and Application",  Proceedings of
the 13th  Water Quality Conference, Illinois 1971.
                                    67

-------
Kramer,  H.  P.  "Research Need for Drinking Water", Journal of the
American Waterworks Association.  57:1 (1965),  3-9.

Krishnaswami,  S.  K.   "Health Aspects of Water Quality", American
Journal of Public Health. Vol.  61:11. November 1971. 2259-2268.

Lackner,  Jack.  "Safe Drinking Water Act of 1973:  Estimated Benefits
and Costs",  Environmental Protection Agency, Office of Planning and
Evaluation, January 1973.

Lave.  L.  B. and E. P.  Seskin.  "Air Pollution and Human Health",
Science 169 (3947): 723-733.  August 21,  1970.

        "An Analysis  of the Association  between U. S. Mortality and Air
Pollution",  Carnegie-Mellon University, September  1970.

Liu, Oscar C, M. D.   Private Communication to Dennis Tihansky,  En-
vironmental Protection Agency, August 29,  1972.

Lyon,  W. A.   "Water and Health - Are We Concerned Enough?"  Journal
of Sanitary Engineering Division.  October  1970.

McDermott, J. H.   "Virus and Water Quality: Occurrence and Control -
Conference Summary", Environmental Protection Agency, 1971.

McKee, J.  E.  and H.  W. Wolf. "Water Quality Criteria".  California
State Water Quality Control Board,  Pub. No.  3-A,  1963.

McLean,  D. M.  et al.  "Virus Dispersal by  Water" Journal of the
American Waterworks Association.  1966,  920.

Melnick,  J.  L.  "Detection of Virus Spread  by the Water Route",  Proceed-
ings of the 13th Water  Quality Conference,  Illinois,  1971.

Metcalf.  T.  G.  "Biologic  Parameters in Water Transmission of Viruses"
Proceedings of the 13th Water  Quality Conference,  Illionis,  1971.

Mosley,  James W.  "Transmission of Viral  Diseases by the Water Route."
New York:  J.  Wiley and Sons,  1965.
                                    68

-------
New York, State of.  "Medical Fee Schedule", Workmen's Compensation
Board,  July I960.

Neri, L.  C. ,  J.  S.  Mandel, D.  Hewitt.  "Relation Between Mortality and
Water Hardness in Canada",  The Lancet. April 29, 1972, 931-934.

Ongerth,  Henry, et al.  "Public Health Aspects of Organics in Water"
Journal of the American Water Works Association,  July 1973,  495-498.

Petersen, Norman J.  and Virginia D.  Hines.  "The Relation of Summer-
time Gastroentestinal Illness to the Sanitary Quality of the Water Supplies
in Six Rocky Mountain Communities",  American Journal of Hygiene.  I960,,

Poynter,  S. F.  B.   "The Problem of Viruses in Water", Water Treatment
and Examination 1968,  187-204.

Purdue  University,  "The Utilization of Health Services:  Indices and
Correlates",  Health Services  Research and  Training Program, Indiana,
1972.

Ridker, Ronald G.   Economic Costs of Air Pollution,  New York: Frederick
A. Praeger,  1967.

Schafer, E.   Public Health Aspects of Viruses in Water. N.  A. Clarke,  (ed. ),
Cincinnati,  Ohio, 1 (1970), 14.

Schroeder,  H. A.  "Municipal Drinking Water and Cardiovascular Death
Rates", Journal of the American Dietetic Association 195:  2,  1966, 125-
129.

Shakman, Robert A.  "Nutritional Influences on the Toxicity of Environmen-
tal Pollutants," ^ff^^^^^f^^^iJlf^l^i'  February 1974,  105-113.

Shy. Carl M.  and John F. Finklea.  "Air Pollution Affects Community
Health", Environmental Science and Technology,  Volume 7:  3, March  1973,
204-208.

Shy, C. M. ,  et al.  "An Overview of CHESS".  In Health Consequences  of
Sulfur Oxides:  A Report from CHESS.  1972.

Smith,  L. S.  et al.   "Public Health Aspects  of Water Pollution Control",
Water Pollution Control,  1969,  544-549.
                                    69

-------
Social Security Administration,  "Medical Care Costs and Prices:  Back-
ground Book", Department of Health, Education and Welfare.

Stevenson, A. H.   "Studies of Bathing Water Quality and Health".
American Journal of Public Health.  43:5, 1953, 529-538.

Thompson,  Donovan,  J. et al.  "Health and the Urban Environment: Air
Pollution, Weather and the Common Cold, " American Journal of Public
Health 60:4.  p.  731-739.

Tihansky, D. "Economic Damages from Residential Use of Mineralized
Water Supply",  Water Resources Research,  April 1974.

USPHS.  Drinking Water  Standards - 1962.   P. H.  S. Publication No. 956,
1963,  61.

Viraraghavan, T.  "Water Quality and Human Health".  Journal of the
American Water Works Association,  October 1973, 647-650.

Weibel,  S. R. et al.   "Waterborne-Disease  Outbreaks, 1946-60",  Journal
of the American Water Works Association,  August 1964, 947-958.

William, D.  B.   "Viruses in Water Supply - Are Present Treatment Prac-
tices Adequate?" Paper  Presented at the Ontario Sect.  Annual Conference,
Toronto, 1971.
                                    70

-------
                          APPENDIX A

                       CALIFORNIA RESULTS


N = 32        Significance of Regression   =.99

Absenteeism = 6. 47 - . 09  Sw + 2. 6E -  1.45F

                                                 Significance

Sw =       Number of water samples
           collected/month                          .95
E  -       Education of operators                    .90

F =        Frequency of testing of water             „ 95

    The dependent variable in the California study supposedly was
unexcused absences from elementary schools. Because of this we had
originally thought that it would be a good test of  the null hypothesis
(i. e. no relationship between water quality and training).  Unfortunately,
we were reminded,  lower  income children and children of transit
parents (numerous in the San  Bernardino-Riverside area) frequently
do not have excuses even when legitimately ill.  Therefore, the measure
is quite "dirty. " By this we mean that we do not know with certainly
what we are measuring.

    The results of the regression reflect this confusion.  The two
operations variables measuring frequency of sample collection and
frequency of  testing are statistically significantly linked in the
hypothesized direction to absenteeism as though it were truly measuring
illness.   The education of the operators  is strongly linked in the
opposite direction. We  would conclude from the latter that for each
incremental improvement in education  of the operators, truancy (or
illness) would increase by 2. 6 percent  of the population.  These findings
make sense only if we interpret the dependent variable as  truancy. In
that event, truancy, which tends to be higher in  urban areas than in rural
areas, is positively correlated with operator education which also tends
to be  higher in urban areas than in rural areas.   If we assume that the
dependent measure captures variance both of truancy and of  illness,  then
the  results are somewhat reasonable.    In that  case, the  two operations
variables are capturing the illness portion of the variance while education
of the operators is capturing the truancy portion.
                                  71

-------
    Admittedly,  the argument is less than convincing,  and, to some
extent,  that supports the original purpose of the test of the California
data.  That is,  because of the dependent  variable, it is the only area
in which we have to make such  strained arguments defending the
linkage between water quality and absenteeism. In all other areas the
drinking water quality results were much, more sensible and straight-
forward. We think that this demonstrates that the  methodology itself
does not necessarily produce positive results which are statistically
significant and consistent with one hypotheses,,
                                    72

-------
                             APPENDIX B

                     The Profile of Medical Practice


                   Average Fee for Initial Office Visits
                 by Census Division and Specialty,  1970

Census Division                                   General Practice

     Total                                              $ 8. 46*

New England                                             9. 19
Middle Atlantic                                          8. 66
East North Central                                      8. 18
West North Central                                      7. 27
South Atlantic                                            8. 56
East South Central                                       7. 13
West South Central                                      7. 21
Mountain                                                7. 78
Pacific                                                 10. 67
    Average                                             7.49
    JU
    "^Based on 852 observations

Notes

      The less populated regions (with relatively low concentrations of
physicians) experienced lower levels of fees for certain office visits than
the more populated regions (•with relatively high concentrations of phy-
sicians).

      Variations in fees presumably reflect differences in the orientation
and training of physicians in various specialties, as well as patient de-
mand for their services and other factors.

      In this  study fees have been compared for a very few general cate-
gories of service.  Medical specialists differ widely in the types of service
they provide,  and the demand for these services differ among geographic
regions.  Specific comparison of fees are therefore difficult.  At the very
least,,  explanations for the marked variations require a great deal more
knowledge than is contained in the available data.  Given these considera-
tions,  assessment of variations and general trends  in physicians' fees
should be approached with caution.
Source:  Sixth Periodic Survey of Physicians.  Nov.  1970.  Center for
         Health Services Research to Development, AMA.
                                     73

-------
SELECTED WATER \ i. Report KO. 2.
RESOURCES ABSTRACTS
INPUT TRANSACTION FORM
4.
7.
9.
12.
IS.
The Health and Economic Effects of
Drinking Water
Author(s) David McCormick
Cristine Candela
Organization
Applied Science and Technology Divisa
Bendix Corporation c
First National Bldg.,201 S. Main St.";
_, -, , ,., ^,-Ann . Arbor. , , Michigan .48108 : s ,,. .
Sponsoring Organization ,
Supplementary Notes
J 3. Accession No.
j w
, 5. Report Date \
6. •'..-- •'.•;.
s- - -
j -
' S. Performing Organization
Report No.
10. Project No.
	 T.T T\ "7 o o *3 d D
WA /J KJbo
•°n 11. Contract/ Grant No.
Suite 68-01-1867 	
303 13. Type ol Report and
•:•• -••• • Period Covered

  16.  Abstract This study examines the relationship between  sub-clinical  common j
illness such as upset stomach (gastrointeritis) and other  low level  di-    j
 eases, and the quality of drinking water and water treatment plant  char-  !
acteristics.  Such illnesses account for the greatest occurrence of  health1
 roblems in the country.  For example, in 1969 the National  Health Survey  i
found that between 67 and 72 percent of disability days arising from ill-
ness fell into the sub-clinical category.
          The basic hypothesis tested was that if one accounts for all
nacro-systems influences on health, the occurrence of sub-clinical ill-
lesses will be highest in those communities with the worst water quality.
The measures of drinking water quality and water treatment plant character-
istics were obtained from a Public Health Survey study  entitled, "Community)
tfater Supply Study."  Elementary school absenteeism was used as a surrogate
neasure for sub-clinical disease.  The primary statistical tool employed   j
    stepwise multiple regression.                                          j
         A linear regression was performed of elementary school absentee-
ism onto total plate counts, measures of water treatment facilities,  and a!
series of control variables.  The number of drinking water-related absences
and their dollar value was then estimated.                                 i
                                                                         --I
  i?a. Descriptors Drinking water, waterborne disease,  epidemics,  viruses,  water
pollution control, sub-clinical illness,  low  level  illness,  school  absen-
;eeism, community health, treatment plants, operators,  socio-economic
status, urban/rural, coliforms, total plate count,  purification
facilities.

  17b. Identifiers
Immunity Water Supply Study, elementary  school  absenteeism,  water  puri-
Eication, operator
17c. CO WRR Field & Group
18. Availability
Abstractor Cristine
salary, adult population.
19. Security Cls:-s. 21 Nv. ni
(Rcpot't) Pago.
20. Securiiy Class. 22.' Price
(Page)
Send To :
WATER RESOURCES SCIENTIFIC INFORMATION CtNTUK
U.S. DEPARTMENT OF THL INTERIOR
WASHINGTON. DC 2O24O
Candela institution Bendix Corporation
WRSIC 102 ( REV. JUNE 1971)

-------