Patent · US Expired

Method for analyzing structure of a treatise type of document image

US6728403B1 · kind B1 · utility

10Cited by
13References
11Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 2, 2000
Grant dateApr 27, 2004
Priority date
Expiry dateFeb 2, 2020

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/416
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for analyzing structure of a treatise type of document image in order to detect a title, an author and an abstract region and recognize the content in each of the regions is provided. In order to analyze the structure of a treatise type of document, first, the document image divided into a number of regions and the divided regions are classified into text regions and non-text regions according to attributes of the regions. And then, the candidate regions representing an abstract and an introduction is selected, thereafter word regions are extracted from the candidate regions, and an abstract content portion is determined. Thereafter, the title and the author are separated by using the basic form and the type definition representing an arrangement of each of journals. Finally, the content of the separated regions is recognized to generate said table of contents.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.