com.norconex.collector.http.filter
Interface IURLFilter

All Superinterfaces:
Serializable
All Known Implementing Classes:
ExtensionURLFilter, RegexURLFilter

public interface IURLFilter
extends Serializable

Filter a document based on its URL, before any download of HTTP headers or content of a document.

It is highly recommended to overwrite the toString() method to representing this filter properly in human-readable form (e.g. logging). It is a good idea to include specifics of this filter so crawler users can know exactly why documents got accepted/rejected rejected if need be.

Implementors also implementing IXMLConfigurable must name their XML tag filter to ensure it gets loaded properly.

Author:
Pascal Essiembre

Method Summary
 boolean acceptURL(String url)
          Whether to accept this URL.
 

Method Detail

acceptURL

boolean acceptURL(String url)
Whether to accept this URL.

Parameters:
url - the URL to accept/reject
Returns:
true if accepted, false otherwise


Copyright © 2009-2013 Norconex Inc.. All Rights Reserved.