...
- sudo apt update
- gpg
- sudo apt install gnupg
- java
wget -qO - https://adoptopenjdk.jfrog.io/adoptopenjdk/api/gpg/key/public | sudo apt-key add -
sudo apt-get install -y software-properties-common
sudo add-apt-repository --yes https://adoptopenjdk.jfrog.io/adoptopenjdk/deb/
sudo apt-get install adoptopenjdk-8-hotspot
sudo apt-get install adoptopenjdk-11-hotspot
sudo apt-get install adoptopenjdk-14-hotspot
sudo apt-get install fontconfig (https://github.com/AdoptOpenJDK/openjdk-build/issues/693 via Dominik Stadler)
sudo apt install ttf-dejavu (same as above)
- sudo apt-get install groovy
- sudo apt-get install maven
- sudo apt-get install subversion
- sudo apt-get install git
- sudo apt-get install file
- installed docker following: https://docs.docker.com/engine/install/ubuntu/
Datasette
I semi-manually created a sqlite db that is a combination of On 12 November 2020, I ran tika-eval's Profile mode and three other tables: a complete file list, a list of mimes from new FileProfile on the corpus. This includes file type detection by Tika and by 'file' and a list of mimes from Tika. In the future, I think we should keep tika-eval's profile db separate from the mimes db., digests and file sizes.
We configured the reverse proxy for To reverse proxy /datasette:
ProxyPreserveHost On
ProxyPass /datasette http://0.0.0.0:8001
ProxyPassReverse /datasette http://0.0.0.0:8001
The .db is in /data1/publish. cd to that directory and then: docker `docker run --name datasette -d -p 8001:8001 -v `pwd`:/mnt datasetteproject/datasette datasette -p 8001 -h 0.0.0.0 /mnt/corpora-metadatafile_profiles.db --config sql_time_limit_ms:60000 :120000 --config max_returned_rows:100000 --config base_url:/datasette/`
HTTPD
/etc/apache2
public directories are symlinks in /usr/share/corpora
...