Patent · US Expired

Method and apparatus for image based document processing

US5943443A · kind A · utility

226Cited by
14References
8Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 23, 1997
Grant dateAug 24, 1999
Priority date
Expiry dateJun 23, 2017

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99936
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present invention provides a document processing apparatus, document processing method and a storage medium for storing thereof on purpose to offer document filing in which document can be registered with a little computation cost and with high speed, and retrieval can be performed with little oversight. In the document processing apparatus, a similar character classifying element classifies characters in a document image into similar character categories in advance and stores the classified categories together with their representative image features. When the document image is registered, a pseudo character recognizing element executes, without identifying each character in the text region, classification into character categories based on the image features less than those used in the ordinary character recognition and stores the category strings generated by identifying each character with the inputted image. In retrieval, a retrieval executing element converts each character in the retrieval keyword into nearest category, and retrieves a document including the converted category string as a part as a result of retrieval.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.