Method and apparatus for discriminating between documents in batch scanned document files
US6996276B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 7, 2004 |
| Grant date | Feb 7, 2006 |
| Priority date | — |
| Expiry date | Sep 15, 2024 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/40
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Discriminating between documents scanned in a batch scanning process is achieved based on various analyses of the constituent document pages. The data provided by the various analyses are compared with each other to determine whether successive pages belong to the same document. Scanned documents result in a page sequence that is analyzed to extract one or more feature attributes for each page. The feature attributes are provided to a feature comparison process in order to assess the similarity of successive pages. If a sufficient likelihood of similarity is found, the compared pages are deemed to be from the same document; otherwise, they are deemed to be from different documents, indicating the existence of a document break. Based on the document breaks, separate scan files may be established. In this manner, the present invention represents eliminates the requirement of user intervention.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.