arrow-left

All pages
gitbookPowered by GitBook
1 of 4

Loading...

Loading...

Loading...

Loading...

Start Datashare

Find the Datashare application on your computer and run it locally on your browser.

Once Datashare is installed, go to 'Finder' > 'Applications', and double-click on 'Datashare':

A Terminal window called 'Datashare.command' opens and describes the technical operations going on during the opening:

⇒ Important: Keep this Terminal window open as long as you use Datashare.

Once the process is done, Datashare should now automatically open in your default internet browser. If it doesn’t, type 'localhost:8080arrow-up-right' as a URL in your browser.

Datashare must be accessed from your internet browser (Firefox, Chome, etc), even though it works offline without Internet connection (see FAQ: ).

You can now .

Can I use Datashare with no internet connection?
add documents to Datashare
Datashare's homepage
Screenshot of Mac's Applications window where Datashare's logo is highlighted
Screenshot of Mac's terminal window with Datashare's starting logs
Screenshot of the homepage of Datashare, the projects' page with one project called 'Default'

Install on Mac

These pages will help you set up and install Datashare on your computer.

Install Datashare

The installer will take care of checking that your system have all the dependencies to run Datashare. Because this software use Apache Tesseractarrow-up-right (to perform Optical Character Recognition, OCR) and Mac doesn't support them out-of-box, heavy dependencies must be downloaded. If your system have none of those dependencies, the first installation of Datashare can take up to 30 minutes.

The installer will set up:

  • Xcode Command Line Tools (if neither XCode or Xcode Command Line Toolsarrow-up-right are installed)

  • Homebrew (if neither Homebrew or MacPorts are installed)

  • Apache Tesseract with MacPorts or Homebrew

  • Java JRE 17

  • Datashare executable

Note: Previous versions of this document referred to a "Docker Installer". We do not provide this installer anymore but Datashare is still and supported with Docker.

Installation fails:

  • Error while installing Homebrew or MacPorts: you can manually install first and then restart the installer.

  • "System Software from application was blocked from loading" : Check in your Mac's "System Settings" > "privacy & security" if you have a section with this mention "System software from application 'Datashare' was blocked from loading" or something similar related to Datashare. If you have this section you'll have to click "Allow" to be able to install datashare.

  • For any other issue check our or with your setup (macOs version) and installer logs (Command+L when the installer is launched and failed).

1

hashtag
Download Datashare

Go to and click 'Download for Mac'.

2

You can now .

Add documents to Datashare

Datashare provides a folder on your Mac to collect documents you want to have in Datashare.

1

hashtag
Find your Datashare folder on your Mac

Open your Mac's 'Finder' by clicking on the blue smiling icon in your Mac's 'Dock':

hashtag
Start the installer

In Finder, go to your 'Downloads' directory and double-click 'datashare-X.Y.Z.pkg':

3

hashtag
Go through the Datashare Installer

Click 'Continue', 'Install', enter your password and 'Install Software':

The installation begins. You see a progress bar. It stays a long time on "Running package scripts" because it is installing XCode Command Line Tools, MacPort, Tesseract OCR, Java Runtime Environment and finally Datashare.

You can see what it actually does by typing command+L: it will open a window which logs every action made.

In the end, you should see this screen:

You can now safely close this window.

published on the Docker Hubarrow-up-right
Homebrewarrow-up-right
Github issuesarrow-up-right
create a new onearrow-up-right
datashare.icij.orgarrow-up-right
start Datashare
Screenshot of the homepage of datashare.icij.org highlighting the 'Download for Mac' button
datashare.icij.orgarrow-up-right

On the menu bar at the top of your computer, click 'Go' and 'Home' (the house icon):

You will see a folder called 'Datashare':

If you want to quickly access it in the future, you can drag and drop it in 'Favorites' on the left of this window:

2

hashtag
Add documents to your Datashare folder on your Mac

Copy or drop the documents that you want to add to Datashare in this Datashare folder.

3

hashtag
Launch Datashare

Open your Applications. You should see Datashare. Double-click on it:

4

hashtag
In the menu, in 'Tasks', open 'Documents'

Expand the menu on the left:

Expand the menu

In 'Tasks', open 'Documents':

On the top right, click the 'Plus' button:

5

hashtag
Choose your options

  • Select the project in Datashare where you want to add your documents. The Default project, which is automatically created, is selected by default.

  • Select the folder or sub-folder on your computer in your 'Datashare' directory containing the documents you want to add. The entire 'Datashare' directory will be added by default.

  • Choose the language of your documents if you don't want Datashare to guess it automatically. Note: If you choose to also extract text from images (at the next option), you might need to install the appropriate language package on your system. Datashare will tell you if the language package is missing. Refer to the documentation to know .

  • Extract text from images/PDFs with Optical Character Recognition (OCR). Be aware the indexing can take up to 10 times longer.

  • Skip already indexed documents if you'd like.

  • Click 'Add'

6

hashtag
Watch the progress of your document addition

Two extraction tasks are now running:

  • The first is the scanning of your Datashare folder - it sees if there are documents to analyze. It is called 'Scan folders'.

  • The second is the indexing of these files. It is called 'Index documents'.

Note: It is not possible to '' while these two tasks are still running. You won't have the entities (names of people, organizations, locations and e-mail addresses) yet. To get these, once your document addition is finished, please follow the steps to '.

But you can start searching in your documents without having to wait for all tasks to be done.

You can now search documents in Datashare.

how to install language packages
Find entities
Find entities'
Open the 'Documents' page
Click the 'Plus' button
Form for adding documents
Screenshot of Mac's dock where the Finder is active in first position
Screenshot of Mac's Finder window with a dropdown menu below the 'Go' entry with the 'Home' entry highlighted
Screenshot of Mac's Home window with an arrow pointing at the 'Datashare' folder in the list
Screenshot of Mac's Home window highligting 'Datashare' entry located in the 'Favorites'
Screenshot of Mac's Applications window with an arrow pointing Datashare's logo
Screenshot of Datashare's homepage highlighting the top icon in the left menu top to expand it
Screenshot of the Downloads window on Mac showing the installer package of Datashare
Screenshot of the Mac installer's first step to install Datashare: 'Introduction'
Screenshot of the Mac installer's third step to install Datashare: 'Installation Type''
Screenshot of the Mac installer's step to install Datashare when username and password are asked
Screenshot of the Mac installer's last step to install Datashare: 'Summary' saying 'The installation was successful.'with a blue 'Close' button
Screenshot of Datashare's homepage with the left menu open highlighting the 'Documents' entry in the 'Tasks' category
Screenshot of Datashare's Documents page highlighting the 'Plus' button at the top right corner
Screenshot of Datashare's 'Add Documents' page with the form showing 5 options, a 'Reset' and an 'Add' buttons
Screenshot of Datashare's Documents page highlighting two lines in a table, one for 'Scan folders' and another one for 'Index documents'