Patent · US Expired

Search and retrieval using document decomposition

US6397213B1 · kind B1 · utility

135Cited by
5References
27Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 12, 1999
Grant dateMay 28, 2002
Priority date
Expiry dateMay 12, 2019

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99935
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Document query and search techniques in which documents to be searched are “decomposed” into “zones,” with each zone representing a grouping of text or graphical image or a combination thereof. The zones are defined within, and associated with a document page. One or more zones in the documents are selected for annotation with text (e.g., keywords), image features, or a combination of both. Document query and search are based on a combination of text annotations and image features. In one implementation for operating a document retrieval system, an unindexed (also referred to as a “query” or “search key”) document is captured into electronic form and decomposed into a number of zones. The zones can be segmented into text zones and image zones. Descriptors are formed for at least one of the zones. The descriptors can include text annotations for text zones, and text annotations and image features for image zones. Documents in a document database are searched, based on the formed descriptors for the unindexed document and the descriptors for the documents in the database. At least one document in the database is identifying as matching …

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.