Patent · US Active

Statistical data fingerprinting and tracing data similarity of documents

US11430244B2 · kind B2 · utility

0Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 23, 2020
Grant dateAug 30, 2022
Priority date
Expiry dateDec 23, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V10/751
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method and computing device for statistical data fingerprinting and tracing data similarity of documents. The method comprises applying a statistical function to a subset of text in a first document thereby generating a first fingerprint; applying the statistical function to a subset of text in a second document thereby generating a second fingerprint; comparing the first fingerprint to the second fingerprint; and determining that the subset of text in the first document matches the subset of text in the second document based on the first fingerprint threshold matching the second fingerprint, wherein the statistical function is a measure of randomness of a count of each character in a subset of text against an expected distribution of said characters.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.