Patent · US Active

Adaptive construction of a statistical language model

US8577670B2 · kind B2 · utility

5Cited by
5References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 8, 2010
Grant dateNov 5, 2013
Priority date
Expiry dateAug 5, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/183
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A statistical language model (SLM) may be iteratively refined by considering N-gram counts in new data, and blending the information contained in the new data with the existing SLM. A first group of documents is evaluated to determine the probabilities associated with the different N-grams observed in the documents. An SLM is constructed based on these probabilities. A second group of documents is then evaluated to determine the probabilities associated with each N-gram in that second group. The existing SLM is then evaluated to determine how well it explains the probabilities in the second group of documents, and a weighting parameter is calculated from that evaluation. Using the weighting parameter, a new SLM is then constructed as a weighted average of the existing SLM and the new probabilities.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.