Patent · US Expired

Sorting image segments into clusters based on a distance measurement

US6562077B2 · kind B2 · utility

98Cited by
64References
23Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 14, 1997
Grant dateMay 13, 2003
Priority date
Expiry dateJan 27, 2018

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/414
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A programming interface of document search system enables a user to dynamically specifying features of documents recorded in a corpus of documents. The programming interface provides category and format flexibility for defining different genre of documents. The document search system initially segments document images into one or more layout objects. Each layout object identifies a structural element in a document such as text blocks, graphics, or halftones. Subsequently, the document search system computes a set of attributes for each of the identified layout objects. The set of attributes are used to describe the layout structure of a page image of a document in terms of the spatial relations that layout objects have to frames of reference that are defined by other layout objects. Using the set of attributes a user defines features of a document with the programming interface. After receiving a feature or attribute and a set of document images selected by a user, the system forms a set of image segments by identifying those layout objects in the set of document images that make up the selected feature or attribute. The system then sorts the set of image segments into meaningful g…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.