scone.robot
Interface PageFilter


public interface PageFilter

This interface defines the functionaltity of a page filter.

Author:
Frank Wollenweber

Method Summary
 boolean filter(RobotHtmlNode robotHtmlNode, QueueEntry qe)
          When a page is loaded the filter method is called for the page.
 

Method Detail

filter

boolean filter(RobotHtmlNode robotHtmlNode,
               QueueEntry qe)
When a page is loaded the filter method is called for the page. Filter gets a robotHtmlNode and can use the getAttribute method to get the attributes attached to the robotHtmlNode. This method decides, if the crawling should be continued or not. The values of all filters are combined with the and operator

Parameters:
robotHtmlNode - to filter
qe - The entry of the queue which causes the robot to download robotHtmlNode. This parameter gives the filter access to the RobotTask and the current scanning-depth.
Returns:
True, if the crawling should be continued.