Patent · US Active

Forensic analysis using synthetic datasets

US10719490B1 · kind B1 · utility

0Cited by
1References
19Claims
0Family size

Assignee

Inventor

Key dates

Filing dateDec 19, 2019
Grant dateJul 21, 2020
Priority date
Expiry dateDec 19, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F17/18
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system, method, and computer-readable medium for generating synthetic data are described. Improved data models for databases may be achieved by improving the quality of synthetic data upon for modeling those databases and for checking the authenticity of existing numerical data. According to some aspects, these and other benefits may be achieved by using numeric distribution information in a schema describing one or more numeric fields and, based on that schema, distribution-appropriate numerical data may be generated. Also, another schema may be used to generate a second set of numerical data having a different distribution that is not expected for the one or more numeric fields. Actual data may be compared against the generated datasets. When the actual data is determined to be statistically similar to the second numerical dataset, an alert may be generated. A benefit includes finding potentially fraudulent datasets using an efficient approach.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.