Synthesizing union tables from the web
US9720896B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 30, 2013 |
| Grant date | Aug 1, 2017 |
| Priority date | — |
| Expiry date | Jul 22, 2035 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/143
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems and techniques are provided for generating a union table with from stitchable tables. Tables may be extracted from web pages to obtain extracted tables. Stitchable tables may be determined from the extracted tables. Hidden attributes for the stitchable tables may be extracted from the web pages from which the stitchable tables were extracted using segmentation of text for contextual data from the web pages into segment sequences, and alignment of the segment sequences. Iterative pairwise alignment may be used to align the segment sequences and obtain aligned segments. The stitchable tables may be joined into a union table. Hidden attributes from the aligned segments may be added to the union table. Headers for the hidden attributes in the union table may be labeled using a database of entities and class labels.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.