There are two ways of installing Datashare on Linux :
install it standalone
install it with Docker
There is only a package for debian/ubuntu systems (.deb file).
If you want to run it standalone with another Linux distribution, you can download the latest version of the Datashare jar here : https://github.com/ICIJ/datashare/releases/latest
And adapt the following launch script : https://github.com/ICIJ/datashare/blob/master/datashare-dist/src/main/deb/bin/datashare to your environment.
Go to the Datashare's website: https://datashare.icij.org/.
2. Click the blue button 'DOWNLOAD FOR FREE'.
3. Save the debian package as a file
4. Install it with the following command line:
$ sudo apt install /dir/to/debian/package/datashare-dist_7.2.0_all.deb
5. Run Datashare with:
$ datashare -h
1. Install Docker for Linux: - https://docs.docker.com/install/linux/docker-ce/centos/
2. Don't forget (if not done during the install process) to add your user to the docker group:
sudo usermod -aG docker your-user
If you do so, you'll have to close your session and open a new one (logout and login again). After having logged in again, you can check that it is working with :
docker psCONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
3. Install Docker Compose: https://docs.docker.com/compose/install/
4. Download the datashare.sh file from https://github.com/ICIJ/datashare-installer/releases/latest
5. On the next window, select 'Save File' and click 'OK'.
6. In your Terminal, you can start installing Datashare:
7. Once installation is done, open your browser and use Datashare at http://localhost:8080/#/:
8. It will :
initialize an Elasticsearch index with Datashare mapping
provide CLI to run Datashare extract, index, name finding tasks
provide a WEB GUI to run Datashare extract, index, name finding tasks, and search in the documents.