Patent · US Active

System and method for indexing streams containing unstructured text data

US9262511B2 · kind B2 · utility

2Cited by
15References
29Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 15, 2013
Grant dateFeb 16, 2016
Priority date
Expiry dateMar 28, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/24568
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system, method and computer readable medium for indexing streaming data. Data may be received from distributed devices connected via a network. Data elements may be stored and allocated to data blocks and events of the block stores. Non-text data may be converted into a text representation. The data may be split into terms, and term frequencies of each term within each of the event may be calculated. Block-level term frequency statics may be calculated based on the term frequencies. Tree index structures, such as the Y-tree index, may be generated based on the block-level term frequency data. The Y-tree index structures may use the terms as keys and pointers to the corresponding data blocks and block-level term frequency data. A search query may be performed over the tree index structures.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.