Image document processing device, image document processing method, program, and storage medium
US8295600B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 7, 2007 |
| Grant date | Oct 23, 2012 |
| Priority date | — |
| Expiry date | Aug 22, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An image document processing device extracts a character sequence image having M number of characters in an image document, divides the image into individual character images, extracts features of the individual character images, and based on the features, selects N (N is an integer more than 1) character images in the order of degree of matching from a font-feature dictionary for storing features of all character images according to fonts, and generates an M×N index matrix for the extracted character sequence. In searching, the device searches an index-information storage section with respect to each search character included in a search keyword in an input search expression, and extracts an image document including an index matrix including the search keyword. This provides an image document processing device and an image document processing method each allowing indexing not requiring user's operation and each allowing highly precise searching without OCR recognition.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.