These are the instructions for the Harvest version 1.2 distribution.
To use Harvest, you need to perform these steps:
harvest-src.tar.gz
harvest-alpha-dec-osf3.0.tar.gz
harvest-sparc-sun-solaris2.3.tar.gz
harvest-sparc-sun-sunos4.1.3_U1.tar.gz
Create a directory in which the Harvest software will live, then change your current directory to it. To unpack the binary distribution, run this command to put the Harvest software into the current directory:
% gzip -dc harvest-CPU-MACHINE-OS.tar.gz | (umask 022; tar xf -)WARNING: You must extract Harvest in an empty directory; do not extract it over any older version of Harvest.
Finally, define the HARVEST_HOME environment variable. HARVEST_HOME is the Harvest directory that you extracted from distribution file.
% cd harvest-1.2 % setenv HARVEST_HOME `pwd` % echo $HARVEST_HOME
You may want to install one of the optional Harvest
components available from the Harvest
distribution sites in the components
directory. To
install and use a component, follow the instructions included in the
desired component distribution. As of March 1995, the following
components are available:
/Harvest
directory points to $HARVEST_HOME. You will
also need to configure your httpd server so that it knows
/Harvest/cgi-bin
contains valid CGI programs. Below are some
examples of how to configure various httpd servers to understand the
/Harvest
installation:
Go to the next step
$HARVEST_HOME/cgi-bin/HarvestGather.cgi
.
There are two variables that you may need to change, HARVEST_HOME
and GZIP_PATH (the directory in which the GNU gzip
program lives):
HARVEST_HOME=/usr/local/harvest GZIP_PATH=/usr/local/bin
Finally, you may need to edit
$HARVEST_HOME/cgi-bin/BrokerQuery.pl.cgi
, to change the
HARVEST_HOME variable near the top of that file. And if
your Perl binary is not /usr/local/bin/perl
then you'll
need to change the first line of BrokerQuery.pl.cgi
as
appropriate.
% $HARVEST_HOME/RunHarvestYou have the option of editing the Gatherer configuration file before it runs the Gatherer. This is how you can add workload constraints, local filesystem gatherering, or other Gatherer features. See the User's Manual for the details on the Gatherer's features.
httpd.conf
config file, e.g.:
Exec /Harvest/cgi-bin/* Your-HARVEST_HOME/cgi-bin/* Pass /Harvest/* Your-HARVEST_HOME/*
If you are running the CERN server in standalone mode, it may be necessary to send it a HUP signal so that the server re-reads its configuration files.
Requires a ScriptAlias and an Alias entry in
conf/srm.conf
, e.g.:
ScriptAlias /Harvest/cgi-bin/ Your-HARVEST_HOME/cgi-bin/ Alias /Harvest/ Your-HARVEST_HOME/
WARNING: The ScriptAlias entry must appear before the Alias entry.
Alternatively, you can substitute the ScriptAlias line with the following line using AddType to identify CGI programs by filename extension. e.g.:
AddType application/x-httpd-cgi .cgi
If you are running the NCSA httpd server in standalone mode, it may be necessary to send it a HUP signal so that the server re-reads its configuration files.
Harvest can be made to work with the GN server with some moderate effort by editing the source code. GN requires that all CGI programs begin with the string /CGI. Below is a partial list of files where you will need to change /Harvest/cgi-bin/ to /CGI/Harvest/cgi-bin/.
src/broker/WWW/800Query.c src/broker/WWW/BrokerQuery.c src/broker/WWW/BrokerQuery.cf src/broker/WWW/HarvestGather src/broker/example/brokers/skeleton/admin/admin.html.in src/broker/example/brokers/skeleton/query-glimpse.html.in src/broker/example/brokers/skeleton/query-wais.html.in src/broker/example/brokers/soifhelp.html src/Harvest
Also, you'll need to make a symbolic link for Harvest. e.g.:
% cd /your/root/data/dir % ln -s $HARVEST_HOME Harvest
Harvest does not work well with the Plexus server because Plexus
will not recognize /Harvest/cgi-bin/prog.cgi
as a CGI program.
No workaround is known, aside from modifying the Harvest source
as with the GN httpd server.