THIS IS A TEST INSTANCE. ALL YOUR CHANGES WILL BE LOST!!!!
- MetExtractorProductCrawler example configuration can be found in the source:
- allows you to specify how the crawler will run your extractor https://svn.apache.org/repos/asf/oodt/trunk/metadata/src/main/resources/examples/extern-config.xml
- AutoDetectProductCrawler example configuration can be found in the source:
- uses the same metadata extractor specification file (you will have one of these for each mime-type)
- allows you to define your mime-types – that is, give a mime-type for a given filename regular expression https://svn.apache.org/repos/asf/oodt/trunk/crawler/src/main/resources/examples/mimetypes.xml
- your file might look something like:
<mime-info> <mime-type type="product/hdf5"> <glob pattern="*.h5"/> </mime-type> </mime-info>
- maps your mime-types to extractors https://svn.apache.org/repos/asf/oodt/trunk/crawler/src/main/resources/examples/mime-extractor-map.xml