Reference resolution for text enrichment and normalization in mining mixed data
US8595245B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 26, 2006 |
| Grant date | Nov 26, 2013 |
| Priority date | — |
| Expiry date | May 11, 2027 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/94
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for enrichment of text which enables mixed data mining includes generating a model for structured data found in tables of a database. In the model, semantically-linked terms are associated with referents, such as field names or cell content of the fields, of the structured data. The referents may be a business object or refer to a business object. A plurality of candidate referring entities in textual data in the database, such as chunks of free text, is identified. For each candidate referring entity, a similarity measure between the candidate referring entity in the textual data and the model is computed to identify referring entities of the candidate referring entities and corresponding business objects/referents to which the referring entities refer. The textual data is enriched with information derived from the business objects.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.