Permutation-based clustering of computer-generated data entries
US12050600B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 22, 2023 |
| Grant date | Jul 30, 2024 |
| Priority date | — |
| Expiry date | May 22, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/284
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer-generated data entry is received. The computer-generated data entry is segmented into a set of tokens. A plurality of different token permutation groupings are determined. Each of the different token permutation groupings includes a different subset of tokens from the set of tokens of the computer-generated data entry. For the computer-generated data entry, a plurality of token permutation grouping identifiers associated with at least a portion of the plurality of different token permutation groupings is obtained. It is determined whether the computer-generated data entry belongs to any data entry cluster among a plurality of previously identified data entry clusters based on a search performed using the token permutation grouping identifiers of the computer-generated data entry.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.