Patent · US Active

Horizon histogram optimizations

US8433702B1 · kind B1 · utility

109Cited by
5References
33Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 28, 2011
Grant dateApr 30, 2013
Priority date
Expiry dateSep 28, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/903
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Values that occur above a threshold frequency for certain characteristic(s) of a data set are identified. A limited number of count buckets are allocated based on the threshold. Buckets store proxy counts for identifying candidate sets of values rather than actual counts. The data set is divided and each portion is analyzed separately, by iterating through each item in that portion. During each iteration, depending on an item's value(s), a bucket is incremented, all buckets are decremented, or a bucket is assigned or reassigned to count different value(s). A candidate set of values and associated counts is selected for a portion based on the buckets. The candidate sets for each portion are merged and, in some embodiments, filtered based on the associated counts. Actual frequencies are then determined for the values that remain in the merged candidate set.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.