Patent · US Active

Regularized latent semantic indexing for topic modeling

US8533195B2 · kind B2 · utility

3Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 27, 2011
Grant dateSep 10, 2013
Priority date
Expiry dateAug 17, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V10/95
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Electronic documents are retrieved from a database and/or from a network of servers. The documents are topic modeled in accordance with a Regularized Latent Semantic Indexing approach. The Regularized Latent Semantic Indexing approach may allow an equation involving an approximation of a term-document matrix to be solved in parallel by multiple calculating units. The equation may include terms that are regularized via either l1 norm and/or via l2 norm. The Regularized Latent Semantic Indexing approach may be applied to a set, or a fixed number, of documents such that the set of documents is topic modeled. Alternatively, the Regularized Latent Semantic Indexing approach may be applied to a variable number of documents such that, over time, the variable of number of documents is topic modeled.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.