Methods and apparatus for detecting and displaying similarities in large data sets
US5953006A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Mar 18, 1992 |
| Grant date | Sep 14, 1999 |
| Priority date | — |
| Expiry date | Mar 18, 2012 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99936
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
Interactive Methods and apparatus for studying similarities of values in very large data sets. The methods and apparatus employ a dotplot in an interactive graphical user interface to make the relationship between the similarities and the data set visible. A variety of filtering, weighting, and compression techniques make it possible to employ the dot plot with sequences of more than 10,000 tokens and to interactively magnify the dot plot, change weighting and display quantization, and view the underlying data. Also disclosed is a technique which is employed in the apparatus for identifying long sequences of similar tokens. The apparatus is used in the study of large bodies of text and code.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.