Statistical data fingerprinting and tracing data similarity of documents
US11430244B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 23, 2020 |
| Grant date | Aug 30, 2022 |
| Priority date | — |
| Expiry date | Dec 23, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V10/751
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and computing device for statistical data fingerprinting and tracing data similarity of documents. The method comprises applying a statistical function to a subset of text in a first document thereby generating a first fingerprint; applying the statistical function to a subset of text in a second document thereby generating a second fingerprint; comparing the first fingerprint to the second fingerprint; and determining that the subset of text in the first document matches the subset of text in the second document based on the first fingerprint threshold matching the second fingerprint, wherein the statistical function is a measure of randomness of a count of each character in a subset of text against an expected distribution of said characters.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.