HEADQUARTERS LIBRARY
                                                      l|J|y
                                                         2 2
                                       ENVIRONMENTAL PROTECTION AGENCY
                                             , D.C. 20460
          Consumer Report
                      Text Retrieval Software
                           Open Forum
                      PostScript Cartridge for HP Printers
                     New Hewlett-Packard LaserJet III Printer
                            Report #7
                            May 1990
  EPA
  220/

  1990.8
PC Technology Assessment Program
EPA National Data Processing Division
Information Centers Branch - RICII, MD-35
Research Triangle Park, V1C 27711
Telephone: (919) 541-0568 (FTS) 629-0568
                                             Pmmdon Rocyded Paper
I

-------
 >" \
.£*;•

-------
                                                                                               221990
                                        PC 7/lP CONSUMER REPORTS
                                  HEADQUARTERS LIBRARY
                                  ENVIRONMENTAL PROTECTION AGENCY
                                  •V'5Hi:-!GTON, D.C. 20460
           From the Editor's Desk
O
           This Consumer Report focuses on text-retrieval, or bibliographic software: packages that allow you to
           search one or more files for records containing a specific keyword or text string.  As you will see, some
           of these products have other capabilities as well, like file and disk management.  We've had a number
           of requests from users for information about this family of software products, and there has been active
           user involvement in the testing and evaluation of the products we obtained for this project.  We hope you
           will find something useful to you in this issue, and that the user assessments of the group of software
           products we examined will help you select the one that's right for your purposes.

           In the  Open Forum, beginning on page 18, is a report on the new PacificPage PostScript cartridge for the
           HP LaserJet II printer.  The cartridge, which only recently  reached the marketplace, allows the non-
           PostScript LaserJet II to emulate a PostScript printer, providing most of the features and capabilities of
           PostScript devices at a fraction of the cost of  a PostScript printer.

           Another article about printers follows  Open Forum in a new section, This & That, in which we plan to
           include items of general interest or updates on topics dealt with in previous Consumer Reports.  In this
           issue,  it contains a report of a new HP printer, the LaserJet III.  This device boasts many of the features
           previously available only on true PostScript devices (which the LaserJet III is not), while it lists for $300
           less than the LaserJet II.  This & That appears on page 19.

           In the  coming months, we will be looking at some of the products on the Interim PC contract, and we'll
           be bringing you information about them in future reports. We'll also tell you about an informal PC TAP
           network we're organizing, with representatives at EPA facilities around the country, to help find people with
           the interest and expertise to assist in future assessment projects. Thanks for your continued and growing
           support for the PC Technology  Assessment Program.
                                                                                 David A. Taylor
                                                                                 PCTAPCoonBnitor
  O
  CM
HEADQUARTERS LIBRARY
ENVIRONMENTAL PROTECTION AGENCY
WASHINGTON, D.C. 20460

-------
                                Text-Retrieval Software
 Introduction
 Few of us have entered the "Information Age" without encountering an information management problem.
 As the rate and volume of publishing increases, so too does the size of our personal text information files.
 What may have begun as a small collection of text files has probably grown into a file collection of epic
 proportions!  Imagine what a task it would be to manually search those files for a specific document or
 keyword. In this scenario what we need is the capability to quickly search our files for relevant information
 and to retrieve that information as fast as we can. With all the microcomputer text-retrieval software
 products now on the market, this has become possible.

 PC TAP recently obtained seven text-retrieval software products.  These products allow you to locate
 information based on  specified  search criteria.  They're useful for textual  analysis and for  managing
 personal information flies.  For example, scientists and researchers at the EPA, who retain large amounts
 of text-based information, can use a product ike AskSam to organize,  retrieve, and maintain their text files.
 In order to leam more  about these time-saving products, we've conducted an evaluation of the following
 text-retrieval programs to see how well they meet our users' needs:
                                 •  AskSam

                                 •  Lotus Magellan
                                 •  Pro-Cite
                                 •  Reference Manager

                                 •  Sonar Professional
                                 •  WordCruncher

                                 •  ZylNDEX

Although this isn't an all-inclusive list, these products comprise a good representative sample of what's
available on the subject.  The text-retrieval software field is extremely dynamic; new packages appear on
the market and others  disappear with great frequency.  Each of these programs is PC based, with the
exception of Sonar Professional, a powerful Macintosh program.

The study evolved over a period of time, and as you'll see, some products were better received than
others.  Several study participants with a  strong interest in text-retrieval software offered evaluations of
additional products not included in our list: Guide and Hyperties. We've included these for comparison, and
hope you'll find the additional information helpful.

During the course of our study we came across a number of different terms that essentially deal with the
same category of software. Some examples: text-based management systems software, file management
software, bibliographic formatters, and text filing and retrieval systems. Furthermore, text-retrieval software
has been called disk hunting software, full text search and retrieval software, indexing software, search
software,  and search and squirt software.  Don't be confused about the terms; they're often  used
interchangeably.  Text-retrieval" software appears to be the most appropriate term, since it covers
bibliographic and full-text  retrieval of stored information.

In the pages that follow we'll explore the capabilities of text-retrieval software and discuss some innovative
ways these programs are being used  by our participants here at the EPA.  Let's first begin with a review
of the general characteristics of text-retrieval software.

-------
About Text-Retrieval Software
Text-retrieval software allows the user to retrieve text by using simple English language commands. For
example, to locate all documents referring to John Doe, initiate a search query and the system will respond
with all occurrences of that phrase. The user can then search the retrieved records for all correspondence
to Suzy Smith between January 1984 and 1989. Keying in the specified search criteria for date will satisfy
that information request.


Applications for text-retrieval software vary widely, as you'll see later in our report. However, there are some
basic evaluation criteria we can apply over a wide range of programs. Useful features include a "no frills"
text editor, Boolean search capabilities, compatibility with word processing files, a variety of input methods,
sort methods, and  automatic indexing.  Many of  the packages we examined differ in method of file
preparation, indexing, query format, search result  displays, user interface, and support. Some of the
products have special features, like  Lotus  Magellan's DOS-Shell capabilities and AskSam's retrieval of
graphics files.
                                      How They Work
Index Creation. There are two types of text-retrieval software, those that create indexes and those that
don't. For those text-retrieval packages that create indexes, a "concordance" is created for every significant
word in the stored document.  What is deemed insignificant? Generally, articles like "a" "an" or "the," in
addition to other words the program has on its "Noise Word List" or "Stopllst."  Because of their lack of
specificity, these words can not be used as search terms, so the program ignores them or treats them as
superfluous; hence, the term "noise." For example, take a look at the list for ZylNDEX; you'll see it's quite
comprehensive (Table 1  p. 4).  Many programs allow you to edit these lists by adding or deleting words.


Indexes serve an important purpose. They enable the program to rapidly identify text segments based on
query input. The program searches the index in a matter of seconds and displays the pertinent information
on screen.  Most of the programs automatically create indexes, and in ZylNDEX and WordCruncher,  for
example, files must be indexed before they can be searched.  Generally, programs that search indexes are
considerably faster than  programs in which every record is searched.
Downloading. Text-retrieval packages are useful for downloading citations and manipulating bibliographic
information.  Users of external information services, such as online databases like Dialog or BRS, may
require text-retrieval programs similar to those outlined above. Pro-Cite and Reference Manager are two
programs we examined that allow you to download bibliographic information, manipulate it, and format it
according to pre-defined styles (i.e., MLA or APA) or to your own format style. Bibliographic references
can be retrieved, sorted, and  merged with other textual information. Not all text-retrieval programs we
examined make provisions for bibliographic output. However, Pro-Cite and Reference Manager are flexible
programs that make such provisions; they perform text manipulation and retrieval.

-------
Table 1: ZylNDEX NOISE WORD LIST
•
about
after
all
also
an
and
another
any
are
at
b
be
because
been
before
being
botwoon
both
but
by
c
came
can
come
could
d
did
do
e
each
f
for
from
g
get
sot
h
ha
had
he
have
her
here
him
himself
his
how
i
if
in
into
it
j
just
k
t
like
m
mada
many
me
might
more
most
much
must
my
n
never
not
now
o
of
on
only
or
other
our
out
over
P
q
r
s
said
same
see
she
should
since
some
stil
such
t
take
than
that
the
their
them
then
there
these
they
this
those
through
to
too
u
under
up
V
very
w
was
way
we
weD
were
what
whofi
wnorB
which
while
who
with
X
would
y
you
your
z



-------
 Boolean Operators.  Information is retrieved by using simple Boolean operators or more complex search
 strings. For example, the Boolean operators "And-Or-Not" work differently to retrieve information. The
 search request:

        •  "red or white"  retrieves documents containing either of the search terms
        •  "ethics and law" retrieves documents containing both terms
        •  "cars not used" retrieves documents containing cars, but not used cars

 Advanced boolean searches involve nesting search terms within parenthesis for more complex searches,
 or specifying that two words are within a certain number of words of each other. For example, in ZylNDEX
 this would be entered as:

        •  (lion or tiger) and (circus w/2 trainer)

 The system is told to search for all documents that contain either the word lion or tiger and circus within
 two words of trainer. Text-retrieval programs have  a variety of user interfaces to deal with  Boolean
 operators.  Some programs, like Sonar Professional,  take a "dialog box" approach. All you have to do is
 "click" on the desired operator parameter; it's not necessary for you to remember the  many  complex
 commands and their input format.


 Evaluation Method	

 In keeping with PC TAP practice, our users were heavily involved in this study.  To advertise our project,
 we published a short article in the "RIC RAGG"(our Information Center newsletter) that identified the study
 and requested that those  interested in participating in the software evaluation contact us. A number of
 interested individuals responded,  and each one received an evaluation questionnaire and a software
 package or several packages to evaluate (depending on the participant's enthusiasm). No limit was set
 on how many programs each participant could evaluate. We gave each person one month to complete
 the evaluation.

 A number of people requested  the same product. For example, we found Lotus Magellan and ZylNDEX
 enjoying more popularity than the others. We circulated the products and tried to give everyone as much
 time as possible to complete their evaluation forms.
                                 The Evaluation Questionnaire

Our evaluation questionnaire was distributed to this opportunity sample of EPA employees in order to
determine the full range of software features, and to elicit their personalized responses to the software. On
the evaluation form, participants had an opportunity to provide examples of how the product has been
applied to their specific work environment. Thus, each participant was able, if he or she chose to do so,
to take a "case-study" approach to the product and its applicability. We asked for examples of how text-
retrieval software will be used in the work place, and what uses have been  made of such software in the
past. We're pleased that a number of respondents made additional comments on the forms.

Following is a  brief description of each product's main features. The products are described in terms of
ease of use, index methods, compatibility with  various software packages, search capabilities,  output
display, user interface, and sort features.  Afterwards, we'll see how well they held up to our evaluators'
expectations.

-------
 Product Descriptions
                                            AskSam

 AskSam v.4.2 is a powerful package with a number of interesting features.  The program comes with
 excellent documentation, a tutorial, and online help. AskSam's high points include a hypertext feature that
 allows global scanning for searching across multiple files, and a dialog display box allowing you to construct
 your own programming options.  This program requires extensive file preparation.  AskSam directs itself
 to free-form entry, so it differs from other programs we examined, like ZylNDEX and Lotus Magellan, that
 are designed around importing pre-formatted text files.  It's only directly compatible with ASCII and dBase
 files. Importing a WordPerfect file will work, but bear in mind that all formatting codes will be stripped.
 Unlike most of the programs we examined, AskSam does not create an index.

 AskSam has a  set of query commands for manipulating  Information. It supports a variety of search
 statements, such as Boolean and proximity searching. Its major drawback with Boolean searching is that
 it requires curly brace delimiter characters  for nesting search statements.  This is really cumbersome
 because the shift characters are difficult to type quickly, especially for a complex search.

 It's important to  understand how  the program is structured.  AskSam is organized around three different
 field types: explicit, implicit, and contextual.  ExptoYfields are similar to a database style named field, like
 a word (eg: "name" or "company") followed by a left bracket.  Implicit fields are data flags such as $ in
 $99.00. Contexfc/a/fields can be searched by a specific characteristic. For example, to search a database
 of names with occupational listings, the occupation "librarian" can be used as a contextual field marker.

 AskSam's sort feature can be used on either a simple or more complex level.  For example, in a series of
 records consisting of names and companies, you can sort by entering either NAME or COMPANY.  More
 advanced sorts and report formatting features require multi-parameter commands.

 The on-screen output is easy to follow and quite user-friendly. After a search, what shows on screen for
 each retrieved record is twenty lines of text at a time.  When  a Boolean search is specified on a command
 line, the entire screen within which the search string appears is displayed. Like Magellan, related records
 can be  linked, so you can search or print as though the related records constituted a single item.

 One of the new features in AskSam version 4.2 is its ability to retrieve graphic images.  Aside from Sonar
 Professional, none of the programs we examined had this capability. AskSam allows the user to associate
 any graphics file with an AskSam record, making it possible to retrieve and display scanned photos,
 captured video, and other digitized graphics information. Combined with its hypertext capabilities, the
 graphics support will let you build and design databases that combine text and graphics into your own
 personal retrieval system.

                                        Lotus Magellan

 Lotus Magellan is a high-powered package that will index an entire hard disk of applications and perform
 searches across applications. Our first use lor Magellan, organizing a 30-meg hard disk,  was relatively
 effortless and was accomplished with great speed.  Magellan is easy to learn to use, especially on the
 basic level. Of ail the packages, it has the most user friendly interface, is most intuitive, arid has easy-to-
foltow documentation.

Magellan requires virtually no file  preparation, it has a list of 120 stop words the user can edit by adding
or deleting words. The terms on the stop list are non-searchable. Unlike AskSam, Magellan is compatible
with a variety of packages, including database, spreadsheet, and word processing products.  It's very
                                              6

-------
              convenient to use, and the program "zooms" right in on specific text strings.  Magellan is organized in an
              easy-to-use format.  It categorizes information by file name and file content, lets you view the information
              within your files, and searches for text and data in multiple files across your hard disk.  Remember that
              before you use Magellan, you must have it index the files on your hard disk. Once this is done, you can
              issue commands to index all new and updated files.

              The program's output display is one of its best features. The screen is divided into two windows: a file list
              display is on the left, and a view of highlighted files within a larger widow is on the right. The file list can
              include all the files that meet pre-defined criteria, like all WordPerfect files or all spreadsheet files. As you
              scroll through the list of files, the window on the right changes to the exact view you'd have if you were
              actually viewing the document from within its application.

              Magellan performs a variety of  search tasks.  The EXPLORE  function is probably the first one you'll
              become familiar with. To make use of this function, enter a search phrase and Magellan locates all of the
              files that contain the search phrase and presents the filenames in ranked order in the LIST window (on the
              left side of the screen).  You can easily generate  a list of files containing a specific word.  Magellan
              performs boolean and wild card searching. You can select groups of files by application, using criteria such
              as filename, file extension, or any combination of these. Once the file list is displayed it can be sorted.
              You can also tag files for manipulations such as erasing or copying.

              Here's a short illustration of how  Magellan  operates.  Suppose  you wrote a paper on  'Information
              Technology Use in the National Library of Greece," and you want to locate the file containing the paper and
              append it to another document entitled "Current Information Technology Trends: Libraries in Greece." You
              could  use  Magellan to search for all occurrences of the words "technology" or "Greece" within all files in
              your word processing applications. You could even use Magellan to gather elements from these files and
              consolidate them into a new document.  For example, you can copy text from multiple files, regardless of
              the applications,  and combine the  text into other files.  In other words, Magellan gives you the power to
              create new documents from old ones.  In addition, you can view your files in their original directory using
              the tree display.

              One of Magellan's most useful features is its DOS shell, which can serve as a DOS front end. In some
              ways, Magellan resembles Automaxx on its more basic level.
H
                                                         Pro-Cite

              Pro-Cite, by Personal Bibliographic Software, Inc., is a  menu-driven package for managing personal
              bibliographic databases. The package consists of Pro-Cite, Pro-Search (a front-end for online searching
              of BRS and Dialog), and a range of Biblio-Links programs that assist with converting downloaded data into
              Pro-Cite compatible format.  While the package is available for both IBM and Macintosh microcomputers,
              in our report we'll only examine the IBM version.  It is important to stress that in order to use Biblio-Links
              your records must have tagged fields, like the tagged fields from DIALOG citations.

              The program assumes bibliographic information, and it comes with twenty pre-defined data structures, or
              Workforms, covering a range of print and non-print materials. These  authority lists can be called up to
              assist in  data input.   In addition, you may create up to six "User  Defined Workforms" to  your own
              specifications.  This is where the program gets its flexibility. Being able to create your own workforms
              allows you to adapt the program to other kinds of textual information. Therefore, you can  rename the
              existing fields to suit the type of information you have.

-------
 Pro-Cite gives you a variety of indexing options, such as word, author, title, or index. Index indexing allows
 you to create an index from the controlled vocabulary field, which is normally used to provide descriptors
 to give you additional search terms. In indexing, a term may be a single word or a phrase. Terms can be
 used separately or in combination to produce a dictionary list of terms. For example, author indexing would
 be used to produce a list with the author's name highlighted above the full citation.  This list could be
 created to form an authority list.

 Two search options are available:  Search and Quick Search. The Search option allows for free text, or
 field-defined, full boolean searching with left or right truncation. The search process is sequential; inverted
 indexes are not used. Quick Search is used for searches confined to the author, title, or date field, and
 is quick because only the appropriate field in each record is searched. Authority lists may be called up
 again at the  search stage, and terms (authors, publishers, journals, index terms, etc.) may be inserted
 directly into the Search Box from the Authority List box.

 Since Pro-Cite Is so  well suited to managing bibliographic information, it gives you a range of output
 formats from which to choose. Some of these are Turabian, ANSI, MLA, APA, University of Chicago, UMI,
 and Science Journal.  As with the Workforms, you can also create additional Punctuation Files (these are
 the style formats), using the Style command.

 Pro-Cite comes with documentation and a tutorial.  Compared to  the other programs surveyed, this is the
 least intuitive and has the most unwieldy and cumbersome manuals.  A UNIX version of Pro-Cite is
 scheduled for release later this year.


                                     Reference Manager

 Reference Manager  has  been  referred  to  as a "bibliography builder and manager that  facilitates
 downloading  of citation information from online services. In this way it's similar to Pro-Cite.  It's a menu-
 driven program with pre-defined fields set  up for book and journal items. The book entry has thirteen pre-
 defined fields and the journal entry format has ten.  Included are fields for author, article, journal title,
 volume number, and year.

 In a number of ways, Reference Manager is similar to Pro-Cite.  It has provisions for downloading files from
 online vendors like Medline/BRS; at the menu screen, just highlight the online service you're downloading
from. Like Pro-Cite, the program includes provisions for maintaining authority control over the terms used
 in citations. At data entry time every author, editor, keyword, and journal name that is new to the current
 Reference Manager file is marked with a flashing asterisk. This signals the user to consider possible
corrections or to check these items against lists of existing names. For example, if the author's last name,
"Jameson," is entered in the record, the asterisk will flash, prompting you to hit F3 to get an alphabetical
 list of names  and check this entry against others.

More than 100 pre-defined journal formats are available.  However, one of the strengths of this package
is the ease with which new formats can  be created.  To  create a bibliography for  a particular journal,
 Reference Manager extracts the  desired citations from the  current file, formats them in accordance with
the format of the selected journal, then places them in a new file in either sequential or alphabetical order.


                                      Sonar Professional

Sonar Professional is a user-friendly Macintosh-based software product. It's a straightforward text storage
and retrieval program that can search through documents created  by a variety of Macintosh programs, like
                                               8

-------
Microsoft Word, Microsoft Writer, Microsoft Works, WriteNow, WordPerfect, and Full Write Professional.
It also supports some desktop publishing packages like PageMaker and Ready-Set-Go! The program offers
a number of useful retrieval capabilities, and can also accept ASCII files. In its high-speed searching mode,
Sonar Professional purports to search over 10,000 pages per second!

Automatic indexing is supported.  The program creates an ASCII text file containing a list of all the words
and phrases to be indexed. The list includes one word or phrase per line, and you can print it out for easy
reference.  The index terms are in alphabetical order, and the occurrence of each term in the document
is listed for added information.   Sonar Professional is multi-finder friendly, so you can switch among
programs while the index setup is running.

The program is effective in its user interface design.  It has the standard Macintosh pull-down menus,
mouse operations, and dialog boxes.  The documentation is easy to follow, and after going through the
demo disk you have enough of a background to begin using the program.

One of Sonar Professional's  best features is its easy-to-use searching capabilities.  Searching can be
limited to specific documents or to groups of documents. The program supports exact word and phrase
retrieval, embedded character string  matching, right truncation for root  word searches, and  wildcard
searching.  "And-Or-Not" Boolean search terms can be used to create complex search  specifications by
linking several search terms together.  Proximity commands can locate search terms within a specified
number of words of one another.  Sonar Professional is easy to use because you don't have to remember
complex search terms.  You complete the search specifications by simply filling in the blanks in the dialog
box and selecting the appropriate operators. Search terms can be typed or cut and pasted from displayed
documents using the Macintosh clipboard.

Sonar Professional's output display is by paragraph.  It displays the paragraphs, with the desired search
terms, in separate windows with a title bar that indicates their page location within the documents. Search
terms are highlighted for quick identification.  Each paragraph is annotated with its  document name and
page number. The user can then select items from the displayed paragraphs and append them to the
Macintosh clipboard for later editing, printing, or other manipulation.  Like AskSam, Sonar Professional
retrieves graphics found in documents; graphics appear in a separate window.
                                        WordCruncher

WordCruncher resembles AskSam in file preparation, but differs from AskSam since it's not designed to
search across files. It's much better suited for searching one large file.  WordCruncher is not an intuitive
program, and it lacks the user-friendly atmosphere of Lotus Magellan and Sonar Professional.

The program is best suited for textual and literary analysis;  originally,  WordCruncher was conceived at
Bringham Young University for these purposes.   WordCruncher can  analyze patterns buried within a
textbase for comparative analysis. For example, you could find out if two papers were written by the same
author by analyzing word patterns and keyword occurrences.

WordCruncher is structured differently from the other programs reviewed in our study.  First of  all, the
program contains two modules, WC Index and WC View.  WC Index indexes your files, and WC View
searches them.  WC Index is used to import text files and to add  multi-level reference codes within the
text. It recognizes three levels of reference codes that identify three levels from the standard format of an
outline.  You can supply the program with lists of words that shouldn't be indexed (like ZylNDEX's list of
noise words). Groups of files can be indexed separately, and large files can be divided into segments and
then joined into massive indexes.  So WC Index really creates a "roadmap" of the file. Once the files are

-------
 indexed, they are added to the "bookshelf" and receive a .BYF extension. Think of this as an "electronic
 library," similar to arranging books manually on your bookshelf. Once you select a file and add it to the
 bookshelf, you're ready to begin searching.

 WC View, the search part of the module, allows you to locate specific references, words, phrases, lists of
 words, substrings, and contextually defined groups of words. The .BYF files contain frequency counts of
 word occurrence, and the total number of words in the document. To search your document, you have two
 choices:  you can either input the term, or scroll through the index and select a highlighted term.  Or you
 can start typing anywhere at the index, and the cursor will be placed as close to the typed term as possible.
 Once you select a term, WordCruncher displays all instances of the word when it occurs, and you can then
 browse the hits "in context."

 This "in context" display, normally shown on screen, contains several lines from a maximum of five (this
 size is adjustable) occurrences.  To select or view any occurrence, just cursor to that section and hit return
 to expand the text to full screen view.  You can then proceed to scroll the text just like you would in a word
 processor. The "hit" terms  are highBghted for easy reference.

 WordCruncher has two limitations.  We  mentioned above that it requires more file preparation than many
 of the other programs because of the three-tiered system of reference codes that provides for mufti-level
 searches.  Furthermore, WordCruncher only uses DOS text files and is not compatible with any of the
 software applications we normally use.

 This is a great program for searching large files of textual information. We've got WordCruncher installed
 in the RIC II Information Center to run with  PC SIG Library on compact disk. All you have to do is select
 a term or software name from the index list (or input your own term), and WordCruncher will search the
 entire compact disk of software program information, and then display those programs that match the
 search request.

                                           ZylNDEX

 ZylNDEX is an easy-to-use text retrieval package that indexes ASCII files and word processing files from
 popular word processors, and performs Boolean searches on those files, including proximity and nesting
 operations. It's a lot like Lotus Magellan, since it's easy to use and can function as a hard disk indexer.
 However, unlike Magellan, ZylNDEX is directly compatible only with word processors, and it supports most
 of the popular ones, including WordPerfect,  Microsoft Word, MultiMate, and WordStar. Like Pro-Cite and
 Reference Manager, it makes provisions for downloaded information from online information services.

 The first thing you'll notice when starting to use the package is that, like WordCruncher, it works in two
 separate steps: indexing and searching. In order to index the files within a directory, just enter ZYI at the
 DOS prompt  and specify the drive, directory, and file names for indexing.  ZylNDEX indexes  entries
 automatically, and allows you to create  user-editable stopiists.  Additionally, groups or types of files can
 be excluded from indexing; all you need to do is highlight the files you want indexed.  To search the
 indexed files,  enter ZYS at the DOS prompt, and you're immediately prompted to enter the search criteria
from the menu.

ZylNDEX supports exact word searches, right truncation (wildcard) for root word searches, left truncation
for suffix matches, and  retrieval specifications based on single and multi-character wildcard symbols.
 Boolean nested search (parenthetical) specifications containing up to 60 characters in length are allowed.
ZylNDEX can locate adjacent words in specified sequences with the "within" operator, thereby permitting
phrase searches.
                                              10

-------
The user interface is clear: some of the functions are menu driven, like display and mark; other operations
are command oriented.  After a search is executed, the output is displayed on screen with search terms
highlighted in displayed documents.  The page and line number are displayed with the text, and you can
easily browse through the displayed documents by scrolling the screen.

Although ZylNDEX provides facilities for field-oriented data, it's primarily useful for documents, document
abstracts, and other unstructured text segments.  The program is well suited for complex applications
because it places few significant limitations on document length.


Evaluation Summaries	

We've just discussed seven powerful packages, and although they basically carry out the same functions,
each  product has its own particular strengths and "quirks."  From speaking with our users, we found that
even  a high-powered package with  one  annoying characteristic could result in someone completely
disregarding that package. Let's take a closer look at the products and find out how well they satisfied our
users' information management needs.

                                           AskSam

Although  AskSam is a powerful and popular program, it has one  of those  annoying features that
automatically excludes it from some users' lists of programs. One of our evaluators, an experienced PC
user,  refused to look at AskSam because of its troublesome curly braces delimiter characters {} used for
nesting operations.  Any time you make an error typing, you have to go back and enter the characters
again. But don't get the wrong impression, AskSam has many strengths. Let's take a  look at how two of
our evaluators put the program to work.

As we mentioned in the beginning of our report, text-retrieval software has numerous applications in a data
processing environment.  One user,  an experienced Senior Computer Operator,  needed a program that
would store and retrieve  error codes,  phone lists, and information from operations manuals.  After
considering WordCruncher and ZylNDEX, AskSam was selected because it required less file preparation
than the other two programs and had more comprehensive documentation. It was chosen above the others
for one more reason:  its hypertext feature. The user wanted to customize the screen for this particular
operation, and AskSam allowed him to construct his own programming options.  In addition, the program
supports an amazing array of  search features, including boolean, proximity, truncation, nesting, numeric,
and multiple searches.

Another  evaluator,  a systems analyst, was investigating the use  of  text-retrieval software for the
management of metadata (data about other databases). He wanted to design and develop a text-retrieval
system that would contain information about environmental databases. AskSam is a flexible package, and
what the evaluator liked most was that the user can leave text unmodified and do straight searches for text,
or perform searches and sorts on specific categories.  AskSam can be complex, but at the same time it
can be executed without delving into  its complexities.  Although AskSam needs less file preparation than
WordCruncher, it can demand a lot  of file preparation if the  application requires many categories of
information for purposes of sorting, retrieval, and directed searches.

Two points to remember if you're considering AskSam: (1) it has a "primitive" editor, and (2) it doesn't
facilitate bibliographic output. Our evaluator combatted the editing problem by using a WordPerfect macro
to insert all special characters  into the text for defining all fields  (categories) for AskSam input.  However,
those of you with bibliographic information will undoubtedly  want to consider another package.  AskSam
makes no provision for bibliographic references and treats them as regular text.
                                             11

-------
 Generally speaking, AskSam is a great package for an experienced PC user.  It can appear overly complex
 to a novice, although with a little training you could run a "canned" AskSam application or use AskSam at
 the "bare bones" level.  Actually, you can use it at the basic level very quickly. For an application using
 no fixed fields and straight text searches, you're better off going with a more straightforward software
 package. Keep this program in mind for fixed applications, such as online manuals, online help facilities,
 or online forms with text information. Our evaluator discourages those who want to use the program to
 manage a wide variety of textual information.

                                         Lotus Magellan

 Lotus Magellan is a powerful program that's designed for hard drive users.  During installation, Magellan
 indexes the files on your hard drive and creates an index file.  Afterwards, updating the index file is easy.
 However, in some cases, users may not have a hard drive and will need to use Magellan with floppies.

 The difficulty in using Magellan with floppies, as our evaluator discovered, is in the indexing procedure.
 For example, when you use Magellan to index your floppy disk for the first time, it looks first for the index
 file, doesn't see it, and stops.  Our user found a way around this by copying the index file onto all of the
 floppies. Magellan was used to copy the index file onto the floppies, then to update the floppy before initial
 use. Once this is done, the program is easy to use. If you are using floppies with Magellan, bear in mind
 that the documentation is written for hard drive users and will be of little use.

 Using Magellan with floppies presents another inconvenience:  slower processing time.  Aside from that,
 our evaluator liked the fact that searches  and sorts use only one keystroke.  Furthermore,  creating
 subdirectories  and moving files is a  logical and intuitive procedure  in  Magellan.   Aside  from the
 inconvenience when using the  program with floppy disks, Magellan comes highly recommended. And for
 use with hard drives, it's an excellent program.

                                            Pro-Cite

 Pro-Cite is a powerful program, but the documentation is unbelievably complex and convoluted! Pro-Cite
 is great for organizing and formatting bibliographic information. Researchers, scholars, librarians, and other
 information professionals who need to maintain bibliographic information and produce bibliographies will
 find this program indispensable. In each Pro-Cite database, users can store up to 32,000 references to
 books, journal articles, dissertations, or any  other media source.

 Pro-Cite is flexible in its input methods.  References can be entered into a Pro-Cite database manually,
 converted and  transferred  from online database searches using Biblio-Links, or imported from other
 database programs.  You can then manipulate the references in a variety of ways:  they can be sorted,
 indexed, searched, and formatted automatically into bibliographies using any punctuation style. Pro-Cite
 eliminates problems with creating bibliographies because it automatically creates accurate bibliographic
 records.  The user doesn't need detailed knowledge about the majority of popular styles.

 Our evaluator,  an experienced PC user, previously used Ref-11, (another  text-retrieval product), for a
private reprint library,  and has  used MASS  11 for text searches.  Although we found the documentation
extremely cumbersome, our evaluator rated  it "good" and found the package easy to install and easy to
use on a basic level. However,  Pro-Cite couldn't incorporate data from Ref-11, so Pro-Cite does not come
recommended for those who have a system  already implemented.
                                              12

-------
                                      Reference Manager

 Reference Manager, by Research Information Systems, Inc., was welt received by both our evaluators.
 Reference Manager's "Capture Module" allows you to import citations from other systems. Although Pro-
 Cite and Reference Manager are similar, why would someone choose one over the other?  Well, our first
 evaluator, who previously worked with Pro-Cite and found it a "cumbersome and difficult program to use,"
 calls Reference Manager a "far superior" product. Keep in mind that this is an experienced PC user who's
 constantly looking for ways to use the PC.  He reported that Reference Manager contains a number of
 journal formats, has a more appealing user-interface, and can be used on a basic level more quickly than
 Pro-Cite.  This may be a result of more readable documentation.

 Our second evaluator,  an experienced PC user, uses text-retrieval products to prepare manuscripts of
 research  results. This is the first text-retrieval software with bibliographic output that he's used, and
. consequently he really enjoyed it. However, the documentation received only a satisfactory rating. Our
 evaluator found the documentation lacking in the area of Data Base Utilities.

                                      Sonar Professional

 A Macintosh expert liked this product and rated the documentation "good." As with most Macintosh
 products, the easy installation procedure and the user interface were found to be appealing.  Since our
 evaluator does not use text-retrieval software very often, these features were an added bonus.

 The menu interface is an important criterion in text-retrieval software. We, too, were amazed at how quickly
 one can begin using Sonar Professional, and at how fast it indexed a document. A most impressive feature
 is its dialog box search strategy. Someone who has a basic knowledge of logical operators can begin
 constructing complex search strings in  minutes. It's not even necessary to remember the parenthesis; all
 you have to do is click  on the right item.

 Sonar Professional  is a fast program.   Although we didn't search 10,000 pages of text to find out if it
 actually searched them in seconds, it appears to be one of the fastest programs on the market.

                                        WordCruncher

 WordCruncher is a difficult program to use, although  it has many impressive capabilities.  After examining
 the documentation and considering the extensive file  preparation necessary to use the program, our
 evaluator opted for AskSam. Another reason our evaluator decided against using WordCruncher to index
 procedures manuals is  that the reference code feature was very time consuming.  These codes provide
 additional information about the text, such  as page, chapter, and paragraph numbering.  Inserting the
 reference codes  is a tedious process, and you must remember to insert these codes in the original text
 before running WC Index. Our evaluator also rated the documentation "poor." Overall, WordCruncher is
 powerful,  but difficult to learn, and this problem is compounded by lack of adequate documentation.

 As we mentioned previously, WordCruncher is installed to run with PC SIG on CD-ROM.  We had an
 evaluator  use it to retrieve text on various software programs.  The evaluator found the processing time
 exceptional, and was impressed that WordCruncher could search the entire  PC SIG compact  disk in a
 matter of seconds!

                                           ZylNDEX

Although ZylNDEX is a powerful package, it's not as intuitive as  Lotus Magellan and can not be used
effectively without constantly consulting the manual.  For example, in order to input search queries, very
                                              13

-------
 little assistance is available to the user on screen.  We had to  refer to the manual for assistance in
 constructing appropriate search queries. This was the main feature our evaluator found fault with. In other
 words, the user who wants to input more complex searches needs to have a good understanding of search
 strategy formulation.

 It took us some time to figure out how to use proximity operators to search adjacent terms.  In ZylNDEX,
 the proximity operator "within" allows you to specify that two words occur within a certain number of words
 of each other.  For example, in order to find steeple within two words of church, you'd enter: "steeple w/2
 church"  and execute the search.  With ZylNDEX, it's not possible to limit a search to paragraphs or
 sentences.  This is a significant limitation.

 ZylNDEX is a useful product since it can,  like Magellan, serve as a hard disk indexer, and it's certainly
 more user friendly than Pro-Cite.

                               Additional Text-Retrieval Products

 Guide and Hyperties are two products one of our evaluators wishes to recommend to our readers.
 Hyperties, by Cognetics Corp., provides hypertext and full-text retrieval.  Guide, by Owl International, is a
 "hypertext authoring" program that organizes and manages free-form information.

 Both packages are good for online manuals.  Hyperties is easier to use than Guide, but not as user
 friendly.  The Windows atmosphere of Guide contributes to its more attractive user interface. The word
 search feature is easier to access than Guide's. Neither of the packages makes provisions for bibliographic
 references.

 Hyperties differs from Guide in one major area. Hyperties accommodates multi-page articles and is easier
 to use when working with  many articles.  The program  seems geared more towards multiple article
 searches while Guide is more geared towards searches within long documents.

 What did our evaluator like least about Hyperties?  First, there is no user-defined field capability, and
 second,  no sort features are supported.  With Guide, there  are some limitations with word searches.
 Furthermore, no summary information is provided about the "hits," information many users often find helpful.

                                        Looking Ahead

 Two text-retrieval programs we evaluated in our report, Lotus Magellan and WordCruncher, have
 announced updates to their products. Let's take a look at some of their newest features.

                                        Lotus Magellan

 Lotus has recently announced the arrival of  Magellan 2.0. The program promises many exciting changes.
Version 2.0 has a "point-and-shoot" feature that lets you  copy from one directory to another.  You can
 adjust keyboard speed, turn the beep on and off, and use macros to control file-group operations. Included
is an ASCII text editor, making it possible to edit ASCII text files from within the program.

We're excited with Magellan's new graphics capability. Viewing graphics files will be possible because two
new viewers allow Magellan to display 1-2-3 and Symphony .PIC files, and the  .DRW files of Lotus
Freelance Plus and Lotus Graphwriter II. Magellan 2.0 also can restore deleted files, automatically update
your index after a specified period of keyboard inactivity, and search new files without indexing them first.
                                              14

-------
                                        WordCruncher

We're very pleased with WordCruncher 5.0.  We took a look at the Demo and were quite impressed with
its new features. WordCruncher 5.0 isn't an off-the-shelf product, as is 4.3; it's licensed to institutions for
their in-house use or for resale purposes.  Version 5.0 operates under Microsoft Windows, and it boasts
enhanced capabilities in the areas of:

                                  •  search and retrieval
                                  •  multi-lingual text searches
                                  •  graphics retrieval
                                  •  hyperlink features
                                  *  program support


Searching text has become much easier. Under Microsoft Windows, you can select a single word from the
text window by double clicking on the chosen word. This immediately positions you at the word in the text
segment, displaying page number, paragraph number, and highlighted search terms.  Search strategy
formulation resembles Sonar Professional's dialog box interface. Rather than  having to remember search
operators, you can just click on the logical connectors within the dialog box. Version 5.0 also functions as
a multi-lingual text-retrieval software package. For example, if the text you're using is in German, you can
change the menu language to the  language of your text.

Version 5.0 allows for graphic retrieval. WC Edit lets you import TIFF files and Microsoft metafiles.  New
hyperlink features from  graphics to text are available.  For example, if you retrieve a graphics image, say
a diagram of a PS/2, you can double-click on a specific item part of the diagram and hyperlink from the
graphics  image to the text location. We could go from a picture of the keyboard to text describing its
function.

We mentioned in our report that WordCruncher was weak in the  area of documentation.  Version 5.0
remedies that problem.  In the new release, the addition of context sensitive help and an enhanced user
interface  has remedied  this problem, and has resulted  in a more powerful product.


Summary	

As you've seen in our report, text-retrieval products are useful for a number of reasons. They allow us to
manage our personal text files for fast and efficient retrieval.  Rather than manually searching through
printed documents, we are able to enter a word or  a search string command and execute a search within
seconds.  More and more, academics, information professionals, and individual PC users with large
amounts of textual information will find programs like these indispensable.
                                             15

-------
 List of Study Contributors
 Ezra Berman
 Veterinary Toxicologist
 HERL, MD-72
 Environmental Research Center
 Research Triangle Park, NC 27711
 (919)541-2784

 Donald R. Block
 Systems Analyst
 NSI Technology Service Corporation
 2 Triangle Drive
 Research Triangle Park, NC 27711
 (919) 541-2301

 Roy Huntley
 Chemical Engineer
 OAQPS, MD-19
 EPA Page Road Facility
 Research Triangle Park, N.C. 27711
 (919) 541-1060

 Sophia Jeffries
 1C Consultant/UNC-CH Graduate Assistant
 Information Centers Branch, MD-35
 EPA National Computer Center
 Research Triangle Park, N.C. 27711
 (919) 541-3661

 Len Miiich
 Technical Writer
 Computer Sciences Corporation, MD-43
 EPA Environmental Research Center
 Research Triangle Park, NC 27711
 (919) 541-3500

Theresa Rhyne
 Information Center Consultant
 Information Centers Branch, MD-35
 EPA National Computer Center
 Research Triangle Park, NC 27711
(919) 541-0207

Sanjay Satsangi
Senior Computer Operator
EPA National Computer Center, MD-24
Research Triangle Park, NC 27711
(919) 541-1035
                                           16

-------
List of Study Contributors
Ralph Smiatowicz
Research Microbiotogist
HERL, MD-82
Environmental Research Center
Research Triangle Park, NC 27711
(919) 541-5776

Libby Smith
NDPD Telecommunications Branch, MD-34
EPA National Computer Center
Research Triangle Park, NC 27711
(919) 541-2795

                                         17

-------
                                       Open Forum

               Open  Forum provides an opportunity for users to share with others their own
               Innovations, or the results of their own technology assessments. The PC Technology
               Assessment Program neither verifies nor endorses the contents of Open Forum items,
               but we are pleased to offer them as a service to users.

               PacIficPage:  PostScript Language Emulation Cartridge
 The RICII Information Center recently obtained a copy of Pacific Data Product's new PostScript emulation
 cartridge. This cartridge allows the HP LaserJet Series II to emulate a PostScript printer, making it possible
 to print a wide range of typefaces and point sizes.  You must have a minimum of two megabytes of
 memory to use the cartridge. The PacificPage cartridge is compatible with all major application programs
 that support PostScript.

 The cartridge is easy to install. Follow these five basic steps:

               1.  Turn off the printer
               2.  Insert the cartridge in the left slot of the printer
               3.  Turn the printer on
               4.  From WordPerfect 5.1 select a PostScript printer as your output device
               5.  Execute the print command

 Preliminary tests were run using WordPerfect 5.1 documents containing large point-size selections (255
 point Helvetica), and landscape orientation. The cartridge performed well, and printing speed was relatively
fast. We also ran two separate tests, comparing output quality and speed between the LaserJet Series
 II with the PacificPage cartridge and the QMS PS810 PostScript printer. The test images appear on the
 next page.

 In the first test, a 12k document was printed; it took 2.5 minutes with PacificPage, and one minute on the
QMS PS810 (Figure on left). In our second test, we used a document containing rotated text and graphics,
with a file size of 8268k. The document took 30 seconds to print on the QMS PS810, and 85 seconds with
the PacificPage cartridge (Figure on  right).

Although the QMS PS810 is faster, for those who don't own a PostScript printer, there are a number of
reasons to consider purchasing the PacificPage cartridge.  Not only are 35 different fonts available, but you
can randomly select point sizes.  For example,  it's possible to specify a point size of 12.7.  In addition,
PostScript language capabilities like  scaling, rotating, reverse text, screens, and patterns, are available.
Furthermore, we found the print quafity comparable to that of our QMS PostScript printer. It's nice to be
able to purchase a cartridge that emulates a PostScript printer,  especially if buying a PostScript printer is
not feasible at the time. The PacificPage cartridge lists for $695 retail.
                                             18

-------
                                                    PC TAP OPEN FORUM
                                                8
                                                EC.
                                                *
                                                £
                                        This &  That
                                HP Introduces LaserJet III

Hewlett Packard has recently introduced the LaserJet III, a powerful printer that combines the best features
of its LaserJet II, with added PostScript printing capabilities. The LaserJet III has marked improvements
in the following areas: productivity, output quality, and ease of use.  And at $2395, it even costs $300 less
than the LaserJet II. The LaserJet III allows you to print rotated text, shaded images, curved lines, and an
assortment of scalable fonts.  Hewlett Packard eventually plans to offer 300 scalable fonts.

According to Hewlett Packard, the LaserJet  Ill's speeds are 46% faster than those of the LaserJet II for
complex pages with graphics, multiple typefaces, and special effects. Here are some of its key features:

               (1)  Scalable fonts
               (2)  Prints landscape and portrait on same page
               (3)  Resolution enhancements
               (4)  More useable standard RAM

Two of the  LaserJet Ill's most significant features are scalable  fonts and resolution enhancements.
Scalable fonts are vector fonts rather than bitmappped fonts.  So the quality of the fonts, whether at 10,12,
or 20 point, remains the same. Resolution enhancement, also known as "fuzzy dot technology," allows the
printer to analyze the jagged edges of text, and to fill in the corners with small gray dots.  The output
quality of characters and curved lines is significantly enhanced.

Before the LaserJet III, users had to choose between a pricey PostScript printer, and an HP LaserJet (or
compatible) with three or four fixed-size typefaces built in.  There are a'number of reasons you may
consider purchasing a LaserJet III:
                                              19

-------
               (1)  The LaserJet U is no longer manufactured
               (2)  The LaserJet III costs only $300 more than the LaserJet II
               (3)  Pending is a release by HP of an Appletalk interface to allow Macintosh
                   computers to use the HP LaserJet III printer
               (4)  Resolution has been enhanced substantially

 The LaserJet III is on order at the RTP Information Centers. We will be testing its capabilities and features,
 and we'll keep you informed about our findings.


 How to Submit Items for Open  Forum
 In keeping with the PC Technology Assessment Program's objective to have the user community actively
 involved in TAP projects, users are encouraged to submit items for inclusion in future PC TAP Consumer
 Reports.  If you have independently investigated the capabilities of a software product or a hardware
 component, we would like to hear from you. We'd also like you to share with others your solutions to any
 problems you may have encountered with a particular application or device, and about tricks, shortcuts,
 or unique applications you have devised.  Although we can't promise to publish every contribution, we will
 evaluate them all in terms of their potential interest to our readers and their conformance to the spirit and
 intent of PC TAP.

 There are no additional rules for Open Forum contributions, but here are some guidelines:

               1.  Contributions must be typed.  Our first preference is that they be
                  submitted on  a floppy disk in WordPerfect format.  If that  isn't
                  possible, the next best method is to  EMAIL  the text to  PCTAP,
                  EPA30647.  The least preferable method, but still acceptable, is to
                  mail a typewritten article to TAP at the address on the cover of this
                  publication.

              2.  The  length of your contribution will be determined somewhat by its
                  complexity. However, keep in mind that we're primarily interested in
                  the purpose of your study project and how pleased you were with the
                  results, not in the nitty-gritty details of how you did it. We will publish
                  your name, address,  and phone number  for those who want more
                  details.  Two to three pages is probably a reasonable  maximum
                  length. On the other hand, a paragraph containing a nugget that may
                  be useful to others would be equally welcome.

              3.  All material submitted by users is subject to our editing, and you will
                  not be given an opportunity to review the final manuscript before
                  publication. Sorry, you'll just have to trust us. If we have questions
                  or don't understand  any part of your text, we'll  contact  you for
                  clarification.


We hope you enjoy PC TAP Consumer Reports, and we look forward to hearing from individuate who
have insights or discoveries to share with others. Thanks for  your interest and your participation in the
PC Technology Assessment Program.

-------