Patent · US Active

Real-time identification of data candidates for classification based compression

US9588980B2 · kind B2 · utility

1Cited by
28References
7Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 22, 2015
Grant dateMar 7, 2017
Priority date
Expiry dateJun 22, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/285
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Identification of data candidates for data processing is performed in real time by a processor device in a distributed computing environment. Data candidates are sampled for performing a classification-based compression upon the data candidates. A heuristic is computed on a randomly selected data sample from the data candidate, the heuristic computed by, for each one of the data classes, calculating an expected number of characters to be in a data class, calculating an expected number of characters that will not belong to a predefined set of the data classes, and calculating an actual number of the characters for each of the data classes and the non-classifiable data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.