Patent · US Active

System and method for investigating large amounts of data

US9208159B2 · kind B2 · utility

72Cited by
1References
27Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 4, 2014
Grant dateDec 8, 2015
Priority date
Expiry dateAug 4, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F17/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A data analysis system is proposed for providing fine-grained low latency access to high volume input data from possibly multiple heterogeneous input data sources. The input data is parsed, optionally transformed, indexed, and stored in a horizontally-scalable key-value data repository where it may be accessed using low latency searches. The input data may be compressed into blocks before being stored to minimize storage requirements. The results of searches present input data in its original form. The input data may include access logs, call data records (CDRs), e-mail messages, etc. The system allows a data analyst to efficiently identify information of interest in a very large dynamic data set up to multiple petabytes in size. Once information of interest has been identified, that subset of the large data set can be imported into a dedicated or specialized data analysis system for an additional in-depth investigation and contextual analysis.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.