Patent · US Active

Reference resolution for text enrichment and normalization in mining mixed data

US8595245B2 · kind B2 · utility

9Cited by
14References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 26, 2006
Grant dateNov 26, 2013
Priority date
Expiry dateMay 11, 2027

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/94
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for enrichment of text which enables mixed data mining includes generating a model for structured data found in tables of a database. In the model, semantically-linked terms are associated with referents, such as field names or cell content of the fields, of the structured data. The referents may be a business object or refer to a business object. A plurality of candidate referring entities in textual data in the database, such as chunks of free text, is identified. For each candidate referring entity, a similarity measure between the candidate referring entity in the textual data and the model is computed to identify referring entities of the candidate referring entities and corresponding business objects/referents to which the referring entities refer. The textual data is enriched with information derived from the business objects.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.