Real-time identification of data candidates for classification based compression
US10387376B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 25, 2017 |
| Grant date | Aug 20, 2019 |
| Priority date | — |
| Expiry date | Feb 19, 2037 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/285
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Identification of data candidates for data processing is performed in real time by a processor device in a computing environment. Data candidates are sampled for performing a classification-based compression upon the data candidates. A heuristic is computed on a randomly selected data sample from the data candidate, the heuristic computed by, for each one of the data classes, calculating an expected number of characters to be in a data class, calculating an expected number of characters that will not belong to a predefined set of the data classes, and calculating an actual number of the characters for each of the data classes and the non-classifiable data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.