Patent · US Active

Integrating and extracting topics from content of heterogeneous sources

US9176969B2 · kind B2 · utility

5Cited by
1References
8Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 29, 2013
Grant dateNov 3, 2015
Priority date
Expiry dateFeb 25, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/951
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Examples relate to integrating and extracting topics from content of heterogeneous sources. Observed words are identified in documents, which are received from the heterogeneous sources. Next, document metadata and source metadata are obtained from the heterogeneous sources. The document metadata is used to calculate word topic probabilities for the observed words, and the source metadata is used to calculate source topic probabilities for the observed words. A latent topic is then determined for one of the documents based on the observed words, the word topic probabilities, and the source topic probabilities.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.