Parsing a markup language document
US8250464B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 18, 2008 |
| Grant date | Aug 21, 2012 |
| Priority date | — |
| Expiry date | Jan 27, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/221
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and system for parsing a markup language document are disclosed in the invention. The method comprises: pre-splitting a body of the markup language document into plurality parts; scanning each of the plurality parts, wherein while each of the parts is scanned, the scanning of the part is stopped only when a specific mark is found, and then a stop point at which the scanning is stopped is recorded; splitting the body of the markup language document into a plurality of fragments using the respective stop points; parsing the plurality of fragments in parallel and producing parsing results for the respective fragments; and combining the parsing results for the respective fragments to form a parsing result for the markup language document. A parsing method that supports namespace is also provided.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.