arrow-left

All pages
gitbookPowered by GitBook
1 of 4

Loading...

Loading...

Loading...

Loading...

Install on Windows

These pages will help you set up and install Datashare on your computer.

hashtag

Install Datashare

You must have Windows 7 Service Pack 2 or any newer version.

1

hashtag
Uninstall any prior standard version

Before we start, please uninstall any prior standard version of Datashare if you had already installed it. You can follow these steps: https://www.laptopmag.com/articles/uninstall-programs-windows-10arrow-up-right

2

hashtag
Download Datashare

Go to and click 'Download for Windows':

The file 'datashare-X.Y.Z.exe' is now downloaded. You can find it in your Downloads.

Double-click on the name of the file in order to execute it.

3

hashtag
Allow Datashare

As Datashare is not signed, this popup asks for your permission. Don't click 'Don't run' but click 'More info':

Click 'Run anyway':

4

hashtag
Install Datashare

On the Installer Wizard, as you need to download and install OpenJDK11 if it is not installed on your device, click 'Install':

The following windows with progress bars will be displayed:

5

hashtag
Install Tesseract OCR

To install Tesseract OCR, click the following buttons on the Installer Wizard's windows:

6

hashtag
Install Datashare.jar

It is now downloading the back-end and the front-end, Datashare.jar:

When it is finished, click 'Close':

You can now .

It asks if you want to allow the app to make changes to your device. Click 'Yes':

Choose a language and click 'OK':

Untick 'Show README' and click 'Finish':

Finally, click 'Close' to close the installer of TesseractOCR.

datashare.icij.orgarrow-up-right
start Datashare
datashare.icij.orgarrow-up-right
Screenshot of the homepage of datashare.icij.org highlighting the 'Download for Windows' button
Screenshot of Windows' window saying 'Windows protected your PC' with the text "Windows Defender SmartScreen prevented an unrecognized app from starting. Running this app might put your PC at risk. More info (which is a link)" and a button 'Don't run'
Screenshot of Windows' window with the title 'Welcome to the ICIJ Setup Wizard' with 2 buttons: 'Install' (which is highlighted) and 'Cancel'
Screenshot of Windows' window saying 'Welcome to the Tessearct-OCR Setup Wizard' with 2 buttons: 'Next (which is highlighted) and 'Cancel'
Screenshot of Windows' window saying 'Licence agreement' with 3 buttons: 'Previous', 'Next (which is highlighted) and 'Cancel'
Screenshot of Windows' window showing 2 radiobuttons: 'Install for anyone using this computer' (which is selected) and 'Install just for me' and with 3 buttons: 'Previous', 'Next (which is highlighted) and 'Cancel'
Screenshot of Windows' window saying 'ICIJ Datashare Setup' with a progress bar and a 'Cancel' button
Screenshot of Windows' window saying 'ICIJ Datashare Setup' with a completed progress bar with 3 buttons: 'Back', 'Close' (which is highlighted) and 'Cancel'
Screenshot of Windows' window saying 'Windows protected your PC' with 2 buttons: 'Run anyway' and 'Don't run'
Screenshot of Windows' window with the question 'Do you want to allow this app from an unknown producer to make changes to your device?' with 2 buttons: 'Yes' (which is highlighted) and 'No'
Screenshot of Windows' window saying 'Please wait (...) Datashare is being installed' with a progress bar and a 'Cancel' button
Screenshot of Windows' window saying 'Please wait (...) Tesseract is being installed' with a progress bar and a 'Cancel' button
Screenshot of Windows' window saying 'Please wait (...) Datashare is being installed' and 'Please wait while Setup is loading'
Screenshot of Windows' window saying 'Please wait (...) Datashare is being installed' containing another window which says 'Please select a language' with a dropdown with 'English' selected' with 2 buttons: 'Ok' (which is highlighted) and 'Cancel'
Screenshot of Windows' window showing some pre-ticked options with 3 buttons: 'Previous', 'Next (which is highlighted) and 'Cancel'
Screenshot of Windows' window showing a pre-ticked 'Destination Folder' (C:\Program Files (x86)\Tesseract-OCR) with 3 buttons: 'Previous', 'Next (which is highlighted) and 'Cancel'
Screenshot of Windows' window saying 'Choose Start Menu Folder' with 3 buttons: 'Back', 'Install' (which is highlighted) and 'Cancel'
Screenshot of Windows' window saying 'Installation Complete' with 3 buttons: 'Back', 'Install' (which is highlighted) and 'Cancel'
Screenshot of Windows' window saying 'Completing the Tesseract-OCR Setup Wizard' with 3 buttons: 'Back', 'Finish' (which is highlighted) and 'Cancel'

Start Datashare

Find the application on your computer and run it locally in your browser.

Open the Windows main menu at the left of the bar at the bottom of your computer screen and click on 'Datashare'. (The numbers after 'Datashare' just indicate which version of Datashare you installed.)

A window called 'Terminal' will have opened, showing the progress of opening Datashare. Do not close this black window as long as you use Datashare.

Keep this Terminal window open as long as you use Datashare.

Datashare should now automatically open in your default internet browser. If it doesn’t, type 'localhost:8080arrow-up-right' in your browser.

Datashare must be accessed from your internet browser (Firefox, Chome, etc), even though it works offline without Internet connection (see FAQ: ).

You can now .

Add documents to Datashare

Datashare provides a folder to collect documents on your computer to index in Datashare.

1

hashtag
Add documents in 'Datashare Data' folder

When you open your desktop in Windows on your computer, you will see a folder called 'Datashare Data'.

Move or copy and paste the documents you want to add to Datashare to this folder:

2

hashtag
Launch Datashare

You will find it in your main menu:

3

hashtag
In the menu, in 'Tasks', open 'Documents'

Expand the menu on the left:

In 'Tasks', open 'Documents':

4

hashtag
Choose your options

  • Select the project in Datashare where you want to add your documents. The Default project, which is automatically created, is selected by default.

5

hashtag
Watch the progress of your document addition

Two extraction tasks are now running:

  • The first is the

You can now .

On the top right, click the "Plus" button:

Select the folder or sub-folder on your computer in your 'Datashare' directory containing the documents you want to add. The entire 'Datashare' directory will be added by default.

  • Choose the language of your documents if you don't want Datashare to guess it automatically. Note: If you choose to also extract text from images (at the next option), you might need to install the appropriate language package on your system. Datashare will tell you if the language package is missing. Refer to the documentation to know .

  • Extract text from images/PDFs with Optical Character Recognition (OCR). Be aware the indexing can take up to 10 times longer.

  • Skip already indexed documents if you'd like.

  • Click 'Add'

  • scanning
    of your Datashare folder - it sees if there are documents to analyze. It is called 'ScanTask'.
  • The second is the indexing of these files. It is called 'IndexTask'.

  • Note: It is not possible to '' while these two tasks are still running. You won't have the entities (names of people, organizations, locations and e-mail addresses) yet. To get these, once your document addition is finished, please follow the steps to '.

    But you can start searching in your documents without having to wait for all tasks to be done.

    search documents in Datashare
    Expand the menu
    Can I use Datashare with no internet connection?
    add documents to Datashare
    Datashare's homepage
    Screenshot of Windows' homepage with an open menu with the entry 'ICIJ' > 'Datashare 1.3' highlighted
    Screenshot of Windows' homepage with a Terminal Window showing logs of Datashare's starting process
    Screenshot of Datashare's homepage, the projects' page with one project called 'Default'
    how to install language packages
    Find entities
    Find entities'
    Open the "Documents" page
    Click the "Plus" button
    Form for adding documents
    Screenshot of Windows' homepage with the Datashare folder icon highlighted
    Screenshot of Windows' homepage with the menu open with the entry 'ICIJ' > 'Datashare 1.3' highlighted
    Screenshot of Datashare's homepage highlighting the top icon in the left menu top to expand it
    Screenshot of Datashare's homepage with the left menu open highlighting the 'Documents' entry in the 'Tasks' category
    Screenshot of Datashare's Documents page highlighting the 'Plus' button at the top right corner
    Screenshot of Datashare's 'Add Documents' page with the form showing 5 options, a 'Reset' and an 'Add' buttons
    Screenshot of Datashare's Documents page highlighting two lines in a table, one for 'Scan folders' and another one for 'Index documents'