Finding partition boundaries for parallel processing of markup language documents
US9477651B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 29, 2010 |
| Grant date | Oct 25, 2016 |
| Priority date | — |
| Expiry date | Dec 6, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/205
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method, a computer program product and a system identify partition locations within an extended markup language (XML) document without parsing so as to process portions of said document in parallel. The XML document includes sections required to remain continuous. The document is scanned for continuous sections without parsing, and boundaries of the initial partitions are adjusted to reside outside the continuous sections to determine resulting partitions for the document. The resulting partitions may be processed in parallel to provide the document information for storage.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.