Methods and apparatus for identifying tables in digital files
US9348848B2 · kind B2 · utility
Assignees
Inventors
Key dates
| Filing date | Apr 26, 2013 |
| Grant date | May 24, 2016 |
| Priority date | — |
| Expiry date | Nov 29, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/414
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for identifying a table in a digital file includes extracting lines from a layout of the digital file, wherein the lines comprise horizontal lines and vertical lines. The method also includes identifying intersected line groups, wherein each intersected line group comprises a horizontal line of the extracted horizontal lines and a vertical line of the extracted vertical lines, the horizontal line and the vertical line intersecting with each other. The method further includes determining whether the number of intersected lines in each intersected line group is larger than a first threshold. If yes, the method further includes identifying an area in which the intersected line groups are located as a table area. If no, the method further includes performing vertical projection on characters in the area, and identifying the area as a table area based on results of the vertical projection.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.