System and method for indexing streams containing unstructured text data
US9262511B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 15, 2013 |
| Grant date | Feb 16, 2016 |
| Priority date | — |
| Expiry date | Mar 28, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/24568
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system, method and computer readable medium for indexing streaming data. Data may be received from distributed devices connected via a network. Data elements may be stored and allocated to data blocks and events of the block stores. Non-text data may be converted into a text representation. The data may be split into terms, and term frequencies of each term within each of the event may be calculated. Block-level term frequency statics may be calculated based on the term frequencies. Tree index structures, such as the Y-tree index, may be generated based on the block-level term frequency data. The Y-tree index structures may use the terms as keys and pointers to the corresponding data blocks and block-level term frequency data. A search query may be performed over the tree index structures.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.