Optimizing large scale data analysis
US11768752B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 21, 2019 |
| Grant date | Sep 26, 2023 |
| Priority date | — |
| Expiry date | Jun 3, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06Q30/0269
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, that facilitate resource and space efficient analysis of large scale datasets. Methods include obtaining activity data for objects in a dataset. For each data item in the dataset, a hashed parameter having a binary representation is generated using an identifier for the object. A register is identified from among a set of registers based on the hashed parameter. A determination is made that the hashed parameter for the object contributes to an aggregation amount that specifies a number of occurrences of the object in the dataset. Based on this determination, an aggregation amount stored in the register is updated. Based on aggregation amounts stored in the set of registers, a reporting output is generated that provides an aggregate distribution of the objects in the dataset based on the activity data for the objects.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.