Entities are additional elements of structured data which can be extracted from within the verbatims. Entities include data such as monetary quantities, dates, currency codes, organisations, people, email addresses, URLs, as well as many other industry specific categories. 

 

The below screenshot shows a verbatim containing three pinned (reviewed) entities: a monetary quantity, a value date, and a trade ID:

 

 

A screenshot of a cell phone

Description automatically generated

 

Figure: An Investment Bank’s operations email verbatim containing three structured data entities: a monetary quantity, a value date, and a trade ID

 

  

Much like labels, predicted entities can be accepted, rejected, or assigned by highlighting a string of text and choosing the correct entity from the list in the modal (see here for how).  Both of these actions will provide training signals to the entity extraction model, which will improve its understanding of that entity type. 

 

Enabling entity extraction and selecting the entities to extract are confirmed either during the creation of the dataset or via the settings section in the Dataset settings page. 



Previous: Clusters     |     Next: Models