Creating NoSQL database index for semi-structured data
US9953102B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 16, 2015 |
| Grant date | Apr 24, 2018 |
| Priority date | — |
| Expiry date | Feb 20, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/319
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Semi-structured source data is preprocessed to obtain text partitions to be stored into a data table with a first combined primary key including a structure thread primary key and a sequence value primary key. The structure thread primary key identifies a structure thread that is segmented into several consecutive intervals according to a determined or predetermined sequence. An inverted index table, created for the preprocessed text partitions, includes a second combined primary key including the structure thread primary key and a keyword primary key. Corresponding to values of the primary keys in the second combined primary key, related text partition sequence IDs are recorded as index values of the inverted index table. Index values having a same keyword primary key value but different structure thread primary key values are located in different rows in the inverted index table. The present techniques improve query efficiency of database index and facilitate updating.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.