...
Code Block | ||||||||
---|---|---|---|---|---|---|---|---|
| ||||||||
@Override public void map(Text key, Writable value, Context context) throws IOException, InterruptedException { if (value instanceof Text) { // if its a url from the seed list String url = key.toString().trim(); // remove empty string or string starting with '#' if (url.length() == 0 || url.startsWith("#")) return; url = filterNormalize(url); if (url == null) { context.getCounter("injector", "urls_filtered").increment(1); |
The code on Line 14 demonstrates the urls_filtered counter for injector counter group being incremented by 1.
The end result is that we generate useful, insightful metrics for each mapper and reducer task for any given Nutch Job.
See below for details on each Nutch metric available.
Metrics Table
The table below provides a canonical, comprehensive collection of Nutch metrics. The table is arranged by the sequenced order of tool invocation within a typical Nutch crawl cycle i.e., Injector, Generator, Fetch, Parse, etc.