Patent · US Active

System and method for providing robust topic identification in social indexes

US8549016B2 · kind B2 · utility

13Cited by
59References
22Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 29, 2009
Grant dateOct 1, 2013
Priority date
Expiry dateOct 11, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/00
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A computer-implemented method for providing robust topic identification in social indexes is described. Electronically-stored articles and one or more indexes are maintained. Each index includes topics that each relate to one or more of the articles. A random sampling and a selective sampling of the articles are both selected. For each topic, characteristic words included in the articles in each of the random sampling and the selective sampling are identified. Frequencies of occurrence of the characteristic words in each of the random sampling and the selective sampling are determined. A ratio of the frequencies of occurrence for the characteristic words included in the random sampling and the selective sampling is identified. Finally, for each topic, a coarse-grained topic model is built, which includes the characteristic words included in the articles relating to the topic and scores assigned to those characteristic words.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.