System and method for format drift and format anomaly detection
US12373324B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 2, 2022 |
| Grant date | Jul 29, 2025 |
| Priority date | — |
| Expiry date | Aug 1, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06Q10/1053
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computerized method for detection of format drift and format anomalies is described. A format representation for each data point of a first data sample is extracted. Transformations of each format representation is conducted, resulting in a first plurality of count values (reference) and a second plurality of count values. Each count value identifies a number of occurrences of a transformed format representation within that data sample. Thereafter, a first probability distribution for the first plurality of count values and a second probability distribution for the second plurality of count values are computed. Analytics using the first and probability distributions are conducted to produce a first metric. A format drift is determined based on an evaluation of the first metric to a second metric operating as a threshold metric. Format anomalies are detected based on analytics of hashed format representation and determination of infrequent usage of a particular format representation.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.