Regularized latent semantic indexing for topic modeling
US8533195B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 27, 2011 |
| Grant date | Sep 10, 2013 |
| Priority date | — |
| Expiry date | Aug 17, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V10/95
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Electronic documents are retrieved from a database and/or from a network of servers. The documents are topic modeled in accordance with a Regularized Latent Semantic Indexing approach. The Regularized Latent Semantic Indexing approach may allow an equation involving an approximation of a term-document matrix to be solved in parallel by multiple calculating units. The equation may include terms that are regularized via either l1 norm and/or via l2 norm. The Regularized Latent Semantic Indexing approach may be applied to a set, or a fixed number, of documents such that the set of documents is topic modeled. Alternatively, the Regularized Latent Semantic Indexing approach may be applied to a variable number of documents such that, over time, the variable of number of documents is topic modeled.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.