Create Ignore / Watch filter manually
|Top Previous Next|
Alternatively to the Auto-Filter system and the Filter-Assistant, you can create all Ignore/Watch filters manually.
Enter one filter expression per line. All filter definitions are case insensitive, they are converted to lowercase internally.
The button "New Ignore Filter" (or "New Watch Filter") opens a helper dialog where you can enter and validate a new filter expression. The button "Test selected line" lets you validate the filter expression in the active line. The "Test filter" button tests all filter definitions by comparing the filtered text content of the new page with the filtered text content of the old page.
Each line can contain one of the following elements:
WebSite-Watcher supports three types of wildcard filters with the restriction that only one Wildcard is allowed per filter expression:
This form filters everything from the page beginning to the first occurrence "EndText".
For example: *Daily News
This form filters all text areas which begin with "StartText" and end with "EndText".
For example: Downloads:*Publisher
This form filters everything from the last occurrence of "StartText" to the end of the page.
For example: Users online*
WebSite-Watcher supports PERL5 compatible regular expressions which can be used to create complex filter definitions. Regular Expressions must be placed in one of the following functions:
Filters the defined regular expression
For example: regex(\d+ downloads)
Filters only the first occurrence of the defined regular expression
For example: FirstRegex(\d+ downloads)
Filters everything from the page beginning to the first occurrence of the defined regular expression
For example: StartToRegex(\d+ visitors)
Filters everything between two regular expressions
For example: RegexToRegex(Downloads\: \d+,License\:)
Filters everything from the last occurrence of the defined regular expression to the end of the page
For example: RegexToEnd(\d+ users online)
Finds a defined regular expression, extracts all digits from the result and compares them with a pre-defined number. This can for example be used to extract and compare prices. Eg. to only find a match when a certain price is higher than 1000.
For example: RegexCmp(\d+([,\.]\d+)* Euro;,; > 1000)
Page content can also be ignored or watched by CSS class names.
Ignores/Watches all text content that is formatted with the defined CSS class name.
Filters are always executed from top to bottom, that means that the filter in the first line is executed before the filter in the second line, and so on.
The content that is filtered with the first filter will then no longer be available for filter definitions in the following lines.
You have defined the following two ignore filters:
The first filter ignores (deletes) all words "Watcher". The second filter will never find a match since "Watcher" is no longer available. The correct order for these two filters would be:
Here the first filter ignores (deletes) all words "WebSite-Watcher". The second filter can then ignore all remaining words "Watcher".
Outdated ignore filters are automatically detected and deleted after some time when they no longer find any matches. There's no need to delete ignore filters manually. That behavior can be disabled in the Tweaks section (although we do not recommend it).
Outdated watch filters are not deleted automatically, you have to maintain these kind of filters manually.
The feature "Test filter" lets you always verify your filter definitions by comparing the filtered text content of the new page with the filtered text content of the old page.