Patent · US Active

Method and system for fast, generic, online and offline, multi-source text analysis and visualization

US7792816B2 · kind B2 · utility

18Cited by
119References
8Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 31, 2008
Grant dateSep 7, 2010
Priority date
Expiry dateJan 28, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/358
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and systems for text data analysis and visualization enable a user to specify a set of text data sources and visualize the content of the text data sources in an overview of salient features in the form of a network of words. A user may focus on one or more words to provide a visualization of connections specific to the focused word(s). The visualization may include clustering of relevant concepts within the network of words. Upon selection of a word, the context thereof, e.g., links to articles where the word appears, may be provided to the user. Analyzing may include textual statistical correlation models for assigning weights to words and links between words. Displaying the network of words may include a force-based network layout algorithm. Extracting clusters for display may include identifying “communities of words” as if the network of words was a social network.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.