Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

  1. create new branch, e.g jira/solr-11701
  2. mvn dependency:tree on the newly released Apache Tika and MEMORIZE it
  3. upgrade all dependencies in lucene/ivy-versions.properties – make sure that they are in alphabetical order
  4. add any new licenses in solr/licenses – must include a -LICENSE-XYZ.txt and -NOTICE.txt file for every jar
  5. update anything new in solr/contrib/extraction/ivy.xml
  6. ant clean (out of nervous habit) and then run the unit tests in contrib/dataimporthandler-extras and contrib/extraction
  7. Fix any problems in the source code, and this can include XLSXResponseWriter which relies on Apache POI.
  8. ant clean-jars jar-checksums
  9. git add new .sha1 files in solr/licenses and lucene/licenses and git rm old .sha1 files
  10. ant precommit
  11. Receive immediate errors that you missed something and go back two steps; repeat ant precommit as needed, waiting 15-20 minutes each time ... if you didn't break something obvious.
  12. In my environment, ant precommit eventually ends in errors about broken links in html. This means you are successful!!!
  13. Run ant test for kicks.  Something will likely break.  Try to figure out if it is caused by anything you did or just a flaky build.  Bonus points if the test failure is reproducible and you report it/fix it.

Phase 2: Integration Testing Solr

...

  1. When everything looks good, commit your changes and submit a PR

Phase 4: Reflect, Rejoice, Work

  1. Reflect on:
    1. The tedium to get the dependencies right and the risks of not getting them right
    2. The ever present risks of jar hell by integrating Tika into Solr
    3. The seductive belief that Tika won't break Solr, when we know it will eventually, and we should really be keeping Tika out of Solr if at all possible...and yet maintain the awesome easy-to-get-started-ness of the current integration.
    Work onSOLR-7632 and/or SOLR-11721  TIKA-3093!
  2. Rejoice that Tika is being refactored out of Solr in 9x.
  3. Work towards whatever solution allows for an easy, out of the box extraction process for binary files