Patent · US Expired

System and method for performing efficient document scoring and clustering

US7610313B2 · kind B2 · utility

27Cited by
38References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 25, 2003
Grant dateOct 27, 2009
Priority date
Expiry dateNov 9, 2024

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99937
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method for providing efficient document scoring of concepts within a document set is described. A frequency of occurrence of at least one concept within a document retrieved from the document set is determined. A concept weight is analyzed reflecting a specificity of meaning for the at least one concept within the document. A structural weight is analyzed reflecting a degree of significance based on structural location within the document for the at least one concept. A corpus weight is analyzed inversely weighing a reference count of occurrences for the at least one concept within the document. A score associated with the at least one concept is evaluated as a function of the frequency, concept weight, structural weight, and corpus weight.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.