Performing the Entity Extraction
The goal of the Entity Extraction process is to find locations in the text which contain entity information. In other words, it tries to find occurrences of person names, locations, organizations, VAT numbers, etc.
The entity extraction finds entities in the set of documents located in the Documents folder of a Case Project. The Documents folder is a special predefined folder which contains all input documents for the entity extraction. This means that download bookmarks or import documents from local disk into the Document folder is a prerequisite for running the entity extraction process.
The Documents folder is marked with a special icon:
. Whenever the Documents folder shows a greater sign in front of its name, the entity extraction needs to run:
You can run the entity extraction as follows:
-
In the Workspace Navigator view click on the case project containing the Documents folder which is not up-to-date.
-
In the main menu click on Project > Build Extraction. The entity extraction starts to run, you can monitor the progress in the Progress View
As soon as the entity extraction has finished, the Documents folder is no longer marked with the > sign: