Patent · US Active

Cluster labeling system for documents comprising unstructured text data

US9672279B1 · kind B1 · utility

15Cited by
9References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 30, 2014
Grant dateJun 6, 2017
Priority date
Expiry dateMay 5, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

An apparatus comprises a processing platform configured to implement a cluster labeling system for documents comprising unstructured text data. The cluster labeling system comprises a clustering module and a visualization module. The clustering module implements a topic model generator and is configured to assign each of the documents to one or more of a plurality of clusters based at least in part on one or more topics identified from the unstructured text data using at least one topic model provided by the topic model generator. The visualization module comprises multiple view generators configured to generate respective distinct visualizations of a selected one of the clusters. The multiple view generators include at least a bigram view generator configured to provide a visualization of a plurality of term pairs from the selected cluster, and a summarization view generator configured to provide a visualization of representative term sequences from the selected cluster.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.