Datashare
DownloadAbout ICIJGitHub
  • About Datashare
  • Ask for help
  • Concepts
    • Running modes
    • CLI stages
  • About ICIJ
  • Github
  • 💻On your computer
    • About the local mode
    • Install on Mac
      • Install Datashare
      • Start Datashare
      • Add documents to Datashare
    • Install on Windows
      • Install Datashare
      • Start Datashare
      • Add documents to Datashare
    • Install on Linux
      • Install Datashare
      • Start Datashare
      • Add documents to Datashare
    • Install with Docker
    • Find entities
    • Add more languages
    • Install plugins and extensions
    • Neo4j
      • Install Neo4j plugin
      • Create and update Neo4j graph
  • 🌐On your server
    • About the server mode
    • Install with Docker
    • Add documents from the CLI
    • Add entities from the CLI
    • Authentication providers
      • OAuth2
      • Basic with a database
      • Basic with Redis
      • Dummy
    • Neo4j
      • Install Neo4j plugin
      • Create and update Neo4j graph
    • Performance considerations
  • ⚡Usage
    • Search projects
    • Explore a project
    • Search documents
    • Search with operators or Regex
    • Filter documents
    • Explore a document
    • Batch search documents
    • Star, tag and recommend
    • Keyboard shortcuts
    • Create a Neo4j graph and explore it
    • FAQ
      • General
        • Can I use Datashare with no internet connection?
        • Can I download a document from Datashare?
        • Can I remove document(s) from Datashare?
        • Do you recommend OS or machines for large corpuses?
        • Can I use an external drive as data source?
        • How can we use Datashare on a collaborative mode on a server?
        • How can I contact ICIJ for help, bug reporting or suggestions?
        • Why results from a simple search and a batch search can be slightly different?
        • How can I uninstall Datashare?
        • Advanced: how can I do bulk actions with Tarentula?
        • What should I do if I get more than 10,000 results?
        • How to run Neo4j?
      • Definitions
        • What is an entity?
        • What are NLP pipelines?
        • What is fuzziness?
        • What are proximity searches?
      • Common errors
        • 'We were unable to perform your search.' What should I do?
        • List of common errors leading to "failure" in Batch Searches
        • What if Datashare says 'No documents found'?
        • What if tasks are 'running' but not completing?
        • What if the 'View' of my documents is 'not available'?
        • What do I do if Datashare opens a blank screen in my browser?
        • I see entities in the filters but not in the documents
        • Datashare doesn't open. What should I do?
  • 🤓Developers
    • How to contribute
    • Backend
      • API
      • Database Schema
      • Write extensions
    • Frontend
      • Design System
      • Write plugins
    • CLI with Tarentula
    • Script with Playground
Powered by GitBook

Datashare is an open source project by the International Consortium of Investigative Journalists

On this page
Export as PDF
  1. On your computer

Find entities

This page helps you find entities (people, organizations, locations, e-mail addresses) in your documents.

PreviousInstall with DockerNextAdd more languages

Last updated 5 days ago

Prerequisite: Your documents must be added to Datashare. Check how for Mac, Windows and Linux.

1

In the menu, in 'Tasks', click 'Entities'

2

In the menu or on the top right, click the 'Plus' button or on the page, click 'Find entities':

3

Select your options

  • Select a project where you want to find entities

  • Choose between finding names of people, organizations and locations or finding email addresses. You cannot do both simultaneously, you need to do one after the other, no matter the order.

  • Choose a Natural Language Processing model, that is to say the software which will run the entity recognition. If you want to add more models, you can check how to add them as extensions.

4

In 'Tasks' > 'Entities', watch the progress of your entity recognition:

Once they are done, you can click 'Delete done tasks' to stop displaying tasks that are completed.

5

Explore your entities in the documents

You can now start searching your entities in the documents without having to wait for all tasks to be done.

In the menu, click 'Search' > 'Documents' and open the 'Entities' tab of your documents or use the Entities filters.

💻
Screenshot of Datashare's Entities page with the menu's Entities entry highlighted
Screenshot of Datashare's Entities page with 3 highlights: the menu's 'Plus' button next to Entities entry, the central button 'Find entities' in the empty state and the top right 'Plus' button
Screenshot of Datashare's 'Find Entities' page with the whole form highlighted
Screenshot of Datashare's Entities page with the table which lists tasks and the entity recognition task highlighted in one line