Apparatus and method for dividing document including table
US6865720B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 23, 2000 |
| Grant date | Mar 8, 2005 |
| Priority date | — |
| Expiry date | Mar 23, 2020 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/177
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A table in an HTML document is analyzed to generate cell position data indicating a positional relationship between cells and cell vectors representing characteristics of the cells, and a table type is judged with reference to the cell position data and the cell vectors, and, if the table type is a table describing a table, it is judged whether the data is represented in a column or a row with reference to the cell position data and the cell vectors, and a cut direction of the table is determined, and segments are generated with reference to the table type and the cut direction. If the table type is a table for layout, the cells are clustered with reference to the cell vectors, and the segments are generated with reference to the cell position data and cell cluster information.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.