Patent · US Active

Optimizing differential XML processing by leveraging schema and statistics

US7707491B2 · kind B2 · utility

3Cited by
1References
1Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 6, 2006
Grant dateApr 27, 2010
Priority date
Expiry dateFeb 24, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/143
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Statistical information about instance documents and schema information are used to integrate multiple state transitions that enable sectioning of a structure document, thereby generating an optimum automaton. In integrating state transitions, consecutively matching state transitions are held in the form of an ID list, which is then used to count the number of consecutive state transitions. Furthermore, patterns in the number of occurrences of repetitive elements including nested elements are statistically obtained. Variations of blanks in XML are addressed by using a statistical method. Schema information is used to build an automaton beforehand, thereby initialization overhead of the syntax parsing apparatus is reduced.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.