Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Info
titleTable Ordering Logic

The table is arranged

  1. First by Tool column; the sequenced order of tool invocation within a typical Nutch crawl cycle i.e., Injector, Generator, Fetch, Parse, etc.
  2. Secondly by the Metric Name column; with counter name's ordered alphabetically for the given Metric Group they belong to.
  3. alphabetically
  4. by the Metric Group; alphabetically for the given tool
  5. by Metric Name; alphabetically for the given metric group


Tool/ObjectToolMetric GroupMetric NameDescription

Fetcher
FetcherStatusbytes_downloaded
FetcherStatushitByThrougputThreshold
FetcherStatushitByTimeLimit
FetcherThreadFetcherStatusAboveExceptionThresholdInQueue


FetchItem.notCreated.redirect


outlinks_detected


outlinks_following


ProtocolStatus.getName()


redirect_count_exceeded


redirect_deduplicated

FetcherStatusrobots_denied

FetcherStatusrobots_denied_maxcrawldelay

ParserStatus

ParseStatus.majorCodes[p.getData().getStatus().getMajorCode()]



























Generator








GeneratorEXPR_REJECTED
GeneratorHOSTS_AFFECTED_PER_HOST_OVERFLOW
GeneratorINTERVAL_REJECTED
GeneratorMALFORMED_URL
GeneratorSCHEDULE_REJECTED
GeneratorSCORE_TOO_LOW
GeneratorSTATUS_REJECTED
GeneratorURLS_SKIPPED_PER_HOST_OVERFLOW




Injector
injectorurls_filtered
injectorurls_injected
injectorurls_merged
injectorurls_purged_404
injectorurls_purged_filter

...

./src/java/org/apache/nutch/parse/ParseSegment.java

./src/java/org/apache/nutch/fetcher/Fetcher.java

./src/java/org/apache/nutch/fetcher/FetcherThread.java

./src/java/org/apache/nutch/fetcher/QueueFeeder.java

...