Content options
123
-indskip
Syntax
:
-indskip HTML_tag "exp"
Type
: Web crawling only
Specifies that Verity Spider follow and parse links, but not index, any HTML document that
contains the text of exp within the given HTML_tag. For multiple HTML_tag and exp
combinations, use multiple instances of the
-skip
option.
You can use wildcard expressions, where the asterisk (*) is for text strings and the question mark
(?) is for single characters; for example:
'/my_doc*/year199?'
In Windows, include double-quotation marks around the argument to protect the special
characters, such as the asterisk (*). On UNIX, use single-quotation marks. This is only required
when you run the indexing job from a command line. Quotation marks are not necessary within
a command file (the
-cmdfile
option).
If you use backslashes, you must double them so that they are properly escaped; for example:
C:\\test\\docs\\path
To use regular expressions, also specify the
-regexp
option.
Example
To skip all HTML documents that contain the word "personnel" in the Title element, while still
parsing those documents for links to other documents, use the following:
-indskip title "personnel"
Example
To avoid indexing directory listing pages, while still parsing the document and path links except
for the link to the parent directory, use one of the following, depending on the web server being
indexed:
•
For Netscape web servers, use the following:
-indskip title "*Index of*"
-nofollow "*parent directory*"
•
For Microsoft Internet Information Server, use the following:
-indskip a "*to parent directory*"
-nofollow "*parent directory*"
-maxdocsize
Syntax
:
-maxdocsize integer
Specifies the maximum size, in kilobytes, for documents to be indexed. Any documents larger
than the value specified by the
-maxdocsize
option are ignored.
The default is to index documents of any size.
Summary of Contents for COLDFUSION MX 61 - CONFIGURING AND ADMINISTERING COLDFUSION...
Page 1: ...Configuring and Administering ColdFusion MX...
Page 8: ...8 Contents...
Page 10: ...10 Introduction...
Page 12: ......
Page 36: ...36 Chapter 2 Basic ColdFusion MX Administration...
Page 56: ...56 Chapter 3 Data Source Management...
Page 74: ...74 Chapter 5 Administering Security...
Page 84: ......
Page 132: ...132 Chapter 9 Indexing Collections with Verity Spider...
Page 142: ...142 Chapter 10 Searching Collections with K2 Server...
Page 148: ...148 Chapter 11 Searching Collections with the rcvdk Utility...