Patent · US Active

Real-time identification of data candidates for classification based compression

US10387376B2 · kind B2 · utility

0Cited by
28References
14Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 25, 2017
Grant dateAug 20, 2019
Priority date
Expiry dateFeb 19, 2037

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/285
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Identification of data candidates for data processing is performed in real time by a processor device in a computing environment. Data candidates are sampled for performing a classification-based compression upon the data candidates. A heuristic is computed on a randomly selected data sample from the data candidate, the heuristic computed by, for each one of the data classes, calculating an expected number of characters to be in a data class, calculating an expected number of characters that will not belong to a predefined set of the data classes, and calculating an actual number of the characters for each of the data classes and the non-classifiable data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.