Patent · US Active

Unsupervised detection and categorization of word clusters in text data

US9563666B2 · kind B2 · utility

1Cited by
10References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 9, 2012
Grant dateFeb 7, 2017
Priority date
Expiry dateNov 22, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/93
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Categorizing data sets obtained from a number of sources includes determining the frequency of appearance of symbols in a first collection of data sets and the frequency of appearance of symbols in a second collection of data sets, determining the most significant symbols for the second collection based on the frequency of appearance in the first collection and the frequency of appearance in the second collection, grouping the most significant symbols into groups according to their appearance in the same data set and ranking the data sets in relation to the symbol groups according to a ranking scheme. Related methods, devices, and/or computer program products are described.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.