Robust wrappers for web extraction
US8762829B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 24, 2008 |
| Grant date | Jun 24, 2014 |
| Priority date | — |
| Expiry date | Mar 12, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F2221/2145
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer-implemented method to determine a robust wrapper includes developing a model indicative of the temporal history of a document, such as a web document written in a markup language. Based on the developed model, robustness characteristics are determined for a plurality of different wrappers representing associated paths to the data item in a representation of the document. Based on a result of the determining operation, a result wrapper of the plurality of wrappers is provided. The result wrapper has a desired robustness characteristic.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.