Patent · US Expired

Method and apparatus for discriminating between documents in batch scanned document files

US6735335B1 · kind B1 · utility

20Cited by
7References
24Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 30, 2000
Grant dateMay 11, 2004
Priority date
Expiry dateOct 31, 2021

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/40
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Discriminating between documents scanned in a batch scanning process is achieved based on various analyses of the constituent document pages. The data provided by the various analyses are compared with each other to determine whether successive pages belong to the same document. Scanned documents result in a page sequence. The page sequence is then analyzed to extract one or more features attributes for each page. The feature attributes are provided to a feature comparison process in order to assess the similarity of successive pages. If a sufficient likelihood of similarity is found, then the compared pages are deemed to be from the same document; otherwise, they are deemed to be from different documents, indicating the existence of a document break. Through the display of the page sequence, a user may optionally modify the location of one or more document breaks. Based on the document breaks, separate scan files may be established. In this manner, the present invention represents eliminates the requirement of user intervention.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.