![MACROMEDIA COLFUSION MX 7 - INSTALLING AND USING COLDFUSION... Manual Download Page 132](http://html1.mh-extra.com/html/macromedia/colfusion-mx-7-installing-and-using-coldfusion/colfusion-mx-7-installing-and-using-coldfusion_manual_3307073132.webp)
132
Chapter 9: Indexing Collections with Verity Spider
Specifies that Verity Spider follow and parse links, but not index, any HTML document that
contains the text of exp within the given HTML_tag. For multiple HTML_tag and exp
combinations, use multiple instances of the
-skip
option.
You can use wildcard expressions, where the asterisk (*) is for text strings and the question mark
(?) is for single characters; for example:
'/my_doc*/year199?'
In Windows, include double-quotation marks around the argument to protect the special
characters, such as the asterisk (*). On UNIX, use single-quotation marks. This is only required
when you run the indexing job from a command line. Quotation marks are not necessary within
a command file (the
-cmdfile
option).
If you use backslashes, you must double them so that they are properly escaped; for example:
C:\\test\\docs\\path
To use regular expressions, also specify the
-regexp
option.
Example 1
To skip all HTML documents that contain the word "personnel" in the Title element, while still
parsing those documents for links to other documents, use the following:
-indskip title "personnel"
Example 2
To avoid indexing directory listing pages, while still parsing the document and path links except
for the link to the parent directory, use one of the following, depending on the web server being
indexed:
•
For Netscape web servers, use the following:
-indskip title "*Index of*"
-nofollow "*parent directory*"
•
For Microsoft Internet Information Server, use the following:
-indskip a "*to parent directory*"
-nofollow "*parent directory*"
-maxdocsize
Syntax
:
-maxdocsize integer
Specifies the maximum size, in kilobytes, for documents to be indexed. Any documents larger
than the value specified by the
-maxdocsize
option are ignored.
The default is to index documents of any size.
-metafile
Type
: Web crawling only
Syntax
:
-metafile path_and_filename
Summary of Contents for COLFUSION MX 7 - INSTALLING AND USING COLDFUSION...
Page 1: ...COLDFUSION MX7 Configuring and Administering ColdFusion MX ...
Page 6: ...6 Contents ...
Page 10: ......
Page 78: ...78 Chapter 4 Web Server Management ...
Page 84: ...84 Chapter 5 Deploying ColdFusion Applications ...
Page 102: ...102 Chapter 7 Using Multiple Server Instances ...
Page 104: ......
Page 108: ...108 Chapter 8 Introducing Verity and Verity Tools ...