|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
public interface IHttpCrawlerEventListener
Allows implementers to react to any crawler-specific events.
CAUTION: Implementors should not implement this interface directly.
They are strongly advised to subclass the
HttpCrawlerEventAdapter
class instead for forward compatibility.
Keep in mind that if defined as part of crawler defaults, a single instance of this listener will be shared amongst crawlers (unless overwritten).
Method Detail |
---|
void crawlerStarted(HttpCrawler crawler)
void documentRobotsTxtRejected(HttpCrawler crawler, String url, IURLFilter filter, RobotsTxt robotsTxt)
void documentURLRejected(HttpCrawler crawler, String url, IURLFilter filter)
void documentHeadersFetched(HttpCrawler crawler, String url, IHttpHeadersFetcher headersFetcher, Properties headers)
void documentHeadersRejected(HttpCrawler crawler, String url, IHttpHeadersFilter filter, Properties headers)
void documentFetched(HttpCrawler crawler, HttpDocument document, IHttpDocumentFetcher fetcher)
void documentURLsExtracted(HttpCrawler crawler, HttpDocument document)
void documentRejected(HttpCrawler crawler, HttpDocument document, IHttpDocumentFilter filter)
void documentPreProcessed(HttpCrawler crawler, HttpDocument document, IHttpDocumentProcessor preProcessor)
void documentImported(HttpCrawler crawler, HttpDocument document)
void documentPostProcessed(HttpCrawler crawler, HttpDocument document, IHttpDocumentProcessor postProcessor)
void documentCrawled(HttpCrawler crawler, HttpDocument document)
void crawlerFinished(HttpCrawler crawler)
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |