Patent · US Expired

Methods and apparatus for detecting and displaying similarities in large data sets

US5953006A · kind A · utility

22Cited by
0References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 18, 1992
Grant dateSep 14, 1999
Priority date
Expiry dateMar 18, 2012

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99936
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

Interactive Methods and apparatus for studying similarities of values in very large data sets. The methods and apparatus employ a dotplot in an interactive graphical user interface to make the relationship between the similarities and the data set visible. A variety of filtering, weighting, and compression techniques make it possible to employ the dot plot with sequences of more than 10,000 tokens and to interactively magnify the dot plot, change weighting and display quantization, and view the underlying data. Also disclosed is a technique which is employed in the apparatus for identifying long sequences of similar tokens. The apparatus is used in the study of large bodies of text and code.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.