Patent · US Active

System, method and apparatus for increasing speed of hierarchial latent dirichlet allocation model

US8527448B2 · kind B2 · utility

2Cited by
0References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 20, 2012
Grant dateSep 3, 2013
Priority date
Expiry dateDec 20, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N5/01
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments of the present invention disclose a data processing method including: sending global initial statistical information to each slave node; merging received local statistical information of each slave node, to obtain new global statistical information; if Gibbs sampling performed by a slave node has ended, calculating a probability distribution between a document and topic and a probability distribution between the topic and a word according to the new global statistical information; according to the probability distributions obtained through calculation, establishing a likelihood function of a text set, and maximizing the likelihood function, to obtain a new hLDA hyper-parameter; and if iteration of solving for an hLDA hyper-parameter has converged, and according to the new hLDA hyper-parameter, calculating and outputting the probability distribution between the document and topic and the probability distribution between the topic and word.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.