System and method for automated selection of best description from descriptions extracted from a plurality of data sources using numeric comparison and textual centrality measure
US10997403B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 19, 2018 |
| Grant date | May 4, 2021 |
| Priority date | — |
| Expiry date | Jul 15, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/413
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques are described for collecting descriptions of an entity from different data sources and using a numeric comparison and textual centrality measure to automatically select a best description. In one implementation, a method includes: retrieving a real property description dataset, the real property description dataset including descriptions from multiple data sources that describe the real property; extracting, from each of the descriptions, numbers that identify the property; performing a numerical comparison of the numbers extracted from each of the descriptions to determine if any descriptions needs to be discarded from further consideration; applying a text cleaning process to normalize the descriptions; and performing a textual centrality measure of remaining descriptions to determine a level agreement of each of the remaining descriptions with each of the other remaining descriptions; and using at least the textual centrality measure to select a description. The selected description may be used to populate a document.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.