Patent · US Active

Page classifier engine

US8392816B2 · kind B2 · utility

5Cited by
21References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 3, 2007
Grant dateMar 5, 2013
Priority date
Expiry dateNov 11, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/416
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments of the present invention relate to classifying pages of an electronic document, such as a scanned book page. OCR software is applied to the contents of the electronic document, revealing semantic information about the content of the electronic document. Software-based features are applied to the semantic information to determine the type of page the electronic document is. Page types may include table of contents (TOC), table of figures (TOF), bibliography, index, or other types of pages commonly found in a book, magazine, or other publication. Once determined, the determined page type is stored and used by other software engines.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.