Custodian disambiguation and data matching
US10394852B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 11, 2016 |
| Grant date | Aug 27, 2019 |
| Priority date | — |
| Expiry date | Jun 14, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06Q50/265
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Provided is a technique for matching different user representations of a person in a plurality of computer systems may be provided. The technique includes collecting information sets about user representations from a plurality of computer systems; normalizing the information sets to a unified format; grouping the information sets in the unified format into indexing buckets based on a user name using a non-phonetic algorithm; determining a similarity score for each pair of information sets in each of the indexing buckets; classifying each information set pair into a set of classes based on the similarity scores, wherein the set of classes comprise at least matches and non-matches; and using a data structure for merging information of information set pairs classified as matches.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.