These are the instructions for the Harvest version 1.2 distribution.

To use Harvest, you need to perform these steps:

  1. Unpack the distribution.
  2. Configure your httpd server.
  3. Install the WWW interface to the Broker.
  4. Run Harvest.


STEP 1: Unpacking the distribution

There are 3 binary distributions and one source distribution, all available from the Harvest distribution sites:
Full Harvest source:
harvest-src.tar.gz
DEC Alpha running OSF/1 v3.0:
harvest-alpha-dec-osf3.0.tar.gz
Sun Sparc running Solaris 2.3:
harvest-sparc-sun-solaris2.3.tar.gz
Sun Sparc running SunOS 4.1.3_U1:
harvest-sparc-sun-sunos4.1.3_U1.tar.gz

Create a directory in which the Harvest software will live, then change your current directory to it. To unpack the binary distribution, run this command to put the Harvest software into the current directory:

        % gzip -dc harvest-CPU-MACHINE-OS.tar.gz | (umask 022; tar xf -)
WARNING: You must extract Harvest in an empty directory; do not extract it over any older version of Harvest.

Finally, define the HARVEST_HOME environment variable. HARVEST_HOME is the Harvest directory that you extracted from distribution file.

	% cd harvest-1.2
	% setenv HARVEST_HOME `pwd`
	% echo $HARVEST_HOME

Optional Harvest Components

You may want to install one of the optional Harvest components available from the Harvest distribution sites in the components directory. To install and use a component, follow the instructions included in the desired component distribution. As of March 1995, the following components are available:

Go to the next step


STEP 2: Configuring your httpd server

Install an httpd server and modify its configuration file so that the /Harvest directory points to $HARVEST_HOME. You will also need to configure your httpd server so that it knows /Harvest/cgi-bin contains valid CGI programs. Below are some examples of how to configure various httpd servers to understand the /Harvest installation: Go to the next step


STEP 3: Installing the WWW interface to the Broker.

You'll need to edit $HARVEST_HOME/cgi-bin/HarvestGather.cgi. There are two variables that you may need to change, HARVEST_HOME and GZIP_PATH (the directory in which the GNU gzip program lives):
	HARVEST_HOME=/usr/local/harvest
	GZIP_PATH=/usr/local/bin

Finally, you may need to edit $HARVEST_HOME/cgi-bin/BrokerQuery.pl.cgi, to change the HARVEST_HOME variable near the top of that file. And if your Perl binary is not /usr/local/bin/perl then you'll need to change the first line of BrokerQuery.pl.cgi as appropriate.

Go to the next step


STEP 4: Running Harvest

To run Harvest, use this command:
	% $HARVEST_HOME/RunHarvest
You have the option of editing the Gatherer configuration file before it runs the Gatherer. This is how you can add workload constraints, local filesystem gatherering, or other Gatherer features. See the User's Manual for the details on the Gatherer's features.

End of instructions.


Installing Harvest on CERN httpd v3.0

Requires an Exec and a Pass entry in the httpd.conf config file, e.g.:
        Exec /Harvest/cgi-bin/* Your-HARVEST_HOME/cgi-bin/*
        Pass /Harvest/* Your-HARVEST_HOME/*

If you are running the CERN server in standalone mode, it may be necessary to send it a HUP signal so that the server re-reads its configuration files.

Return to Step 2


Installing Harvest on NCSA httpd v1.3

Requires a ScriptAlias and an Alias entry in conf/srm.conf, e.g.:

       ScriptAlias /Harvest/cgi-bin/ Your-HARVEST_HOME/cgi-bin/
       Alias /Harvest/ Your-HARVEST_HOME/

WARNING: The ScriptAlias entry must appear before the Alias entry.

Alternatively, you can substitute the ScriptAlias line with the following line using AddType to identify CGI programs by filename extension. e.g.:

       AddType application/x-httpd-cgi .cgi

If you are running the NCSA httpd server in standalone mode, it may be necessary to send it a HUP signal so that the server re-reads its configuration files.

Return to Step 2


Installing Harvest on GN

Harvest can be made to work with the GN server with some moderate effort by editing the source code. GN requires that all CGI programs begin with the string /CGI. Below is a partial list of files where you will need to change /Harvest/cgi-bin/ to /CGI/Harvest/cgi-bin/.

     src/broker/WWW/800Query.c
     src/broker/WWW/BrokerQuery.c
     src/broker/WWW/BrokerQuery.cf
     src/broker/WWW/HarvestGather
     src/broker/example/brokers/skeleton/admin/admin.html.in
     src/broker/example/brokers/skeleton/query-glimpse.html.in
     src/broker/example/brokers/skeleton/query-wais.html.in
     src/broker/example/brokers/soifhelp.html
     src/Harvest

Also, you'll need to make a symbolic link for Harvest. e.g.:

         % cd /your/root/data/dir
         % ln -s $HARVEST_HOME Harvest

Return to Step 2


Installing Harvest on Plexus

Harvest does not work well with the Plexus server because Plexus will not recognize /Harvest/cgi-bin/prog.cgi as a CGI program. No workaround is known, aside from modifying the Harvest source as with the GN httpd server.

Return to Step 2