Patent · US Active

System and method for automated selection of best description from descriptions extracted from a plurality of data sources using numeric comparison and textual centrality measure

US10997403B1 · kind B1 · utility

0Cited by
6References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 19, 2018
Grant dateMay 4, 2021
Priority date
Expiry dateJul 15, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/413
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques are described for collecting descriptions of an entity from different data sources and using a numeric comparison and textual centrality measure to automatically select a best description. In one implementation, a method includes: retrieving a real property description dataset, the real property description dataset including descriptions from multiple data sources that describe the real property; extracting, from each of the descriptions, numbers that identify the property; performing a numerical comparison of the numbers extracted from each of the descriptions to determine if any descriptions needs to be discarded from further consideration; applying a text cleaning process to normalize the descriptions; and performing a textual centrality measure of remaining descriptions to determine a level agreement of each of the remaining descriptions with each of the other remaining descriptions; and using at least the textual centrality measure to select a description. The selected description may be used to populate a document.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.