Permutation-based clustering of computer-generated data entries
US11693851B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 15, 2020 |
| Grant date | Jul 4, 2023 |
| Priority date | — |
| Expiry date | Aug 19, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/284
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer-generated data entry is received. The computer-generated data entry is segmented into a set of tokens. A plurality of different token permutation groupings are determined. Each of the different token permutation groupings includes a different subset of tokens from the set of tokens of the computer-generated data entry. For the computer-generated data entry, a corresponding token permutation grouping identifier is determined for each grouping of the plurality of different token permutation groupings. It is determined whether the computer-generated data entry belongs to any data entry cluster among a plurality of previously identified data entry clusters based on a search performed using the token permutation grouping identifiers of the computer-generated data entry.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.