Patent · US Active

Automatic incremental labeling of document clusters

US9002848B1 · kind B1 · utility

20Cited by
18References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 22, 2012
Grant dateApr 7, 2015
Priority date
Expiry dateJun 21, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/355
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and systems for use in labeling documents within a cluster are provided. One example method includes assembling a set of documents including a first plurality of previously clustered documents and a second plurality of documents. Each of the first plurality of previously clustered documents has at least one label identifying a topic to which content of the document relates. The method includes partitioning documents from the set of documents into multiple clusters, determining if a dominant topic exists within one of the multiple clusters, determining a metric value for one of the multiple clusters based on the number of documents within the one of the multiple clusters having a label identifying the determined dominant topic, and labeling at least documents from the second plurality of documents within the one of the multiple clusters with the label identifying the dominant topic when the metric value exceeds a predetermined threshold.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.