Patent · US Expired

Method and apparatus for discriminating between documents in batch scanned document files

US6996276B2 · kind B2 · utility

8Cited by
8References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 7, 2004
Grant dateFeb 7, 2006
Priority date
Expiry dateSep 15, 2024

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/40
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Discriminating between documents scanned in a batch scanning process is achieved based on various analyses of the constituent document pages. The data provided by the various analyses are compared with each other to determine whether successive pages belong to the same document. Scanned documents result in a page sequence that is analyzed to extract one or more feature attributes for each page. The feature attributes are provided to a feature comparison process in order to assess the similarity of successive pages. If a sufficient likelihood of similarity is found, the compared pages are deemed to be from the same document; otherwise, they are deemed to be from different documents, indicating the existence of a document break. Based on the document breaks, separate scan files may be established. In this manner, the present invention represents eliminates the requirement of user intervention.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.