Processing document image including caption region
US8514462B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 29, 2011 |
| Grant date | Aug 20, 2013 |
| Priority date | — |
| Expiry date | Oct 26, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/422
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An image processing apparatus comprises: a character information acquisition unit configured to acquire character information included in each of a body region and a caption region; an accumulation unit configured to divide the character information acquired from the body region into predetermined set units and to accumulate the character information and position information of the divided set unit in a memory; an anchor term extraction unit configured to extract an anchor term from the character information acquired from the caption region; an anchor term search unit configured to search, based on the character information accumulated in the memory for each set unit, for the set unit including the anchor term extracted; a link information generation unit configured to generate link generation information that associates the set unit found by the anchor term search unit with the object region to which the caption region including the anchor term is appended.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.