j  � ht://Dig: htdig �  

 htdig



= ht://Dig Copyright © 1995-2000 The ht://Dig Group
8 Please see the file COPYING for license information.






Synopsis



 htdig [options]





 Description



< Htdig retrieves HTML documents using the HTTP protocol and= gathers information from these documents which can later be5 used to search these documents. This program can be" referred to as the search robot.





Options





 -a

6 Use alternate work files. Tells htdig to append 8 .work to database files, causing a second copy of5 the database to be built. This allows the original= files to be used by htsearch during the indexing run. When; used without the "-i" flag for an update dig, htdig will< use any existing .work files for the databases to update.

 -c configfile

: Use the specified configfile file instead of the default.

 -h maxhops

6 Restrict the dig to documents that are at most 6 maxhops links away from the starting document.8 This only works if -i is also given.

 -i

1 Initial. Do not use any old databases. This is/ accomplished by first erasing the databases.

 -l

; Stop and restart. Reads in the progress of any previous interrupted digs from the; log file and write the, progress out if interrupted by a signal.

 -s

3 Print statistics about the dig after completion.

 -t

9 Create an ASCII version of the document database. This8 database is easy to parse with other programs so that: information can be extracted from it for purposes other4 than searching. One could gather some interesting! statistics from this database.

-u username:password

9 Tells htdig to send the supplied username and password: with each HTTP request. The credentials will be encoded: using the 'Basic' authentication scheme. There : HAS to be a colon (:) between the username and password.

 -v

4 Verbose mode. This increases the verbosity of the9 program. Using more than 2 is probably only useful for6 debugging purposes. The default verbose mode (using2 only one -v) gives a nice progress report while digging.







Files





 CONFIG_DIR/htdig.conf

" The default configuration file.







See Also



% htmerge,5 htsearch,9 Configuration file format, andI % A Standard for Robot Exclusion.


I Andrew Scherpbier <andrew@contigo.com>
+Last modified: $Date: 2000/02/17 22:05:21 $ ÿÿ