Computer-based systems configured for entity resolution for efficient dataset reduction
US11113255B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 8, 2020 |
| Grant date | Sep 7, 2021 |
| Priority date | — |
| Expiry date | Dec 8, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/20
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
In order to facilitate entity resolution, systems and methods include a processor receiving first records associated with one or more entities, and second records associated with the one or more entities. The processor generates candidate pairs based on a similarity between first entity data and second entity data. The processor generates features for each candidate pair based on similarity measures between the first entity record and the second entity record. The processor utilizes a scoring machine learning model to determine a match score for each candidate pair based on each feature. The processor determines clusters of candidate pairs based on the match score of each feature for each candidate pair. The processor merges records of candidate pairs of each cluster into a respective entity record. The processor determines an entity associated with each entity record and updates an entity database with the entity record.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.