Adaptive construction of a statistical language model
US8577670B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 8, 2010 |
| Grant date | Nov 5, 2013 |
| Priority date | — |
| Expiry date | Aug 5, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/183
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A statistical language model (SLM) may be iteratively refined by considering N-gram counts in new data, and blending the information contained in the new data with the existing SLM. A first group of documents is evaluated to determine the probabilities associated with the different N-grams observed in the documents. An SLM is constructed based on these probabilities. A second group of documents is then evaluated to determine the probabilities associated with each N-gram in that second group. The existing SLM is then evaluated to determine how well it explains the probabilities in the second group of documents, and a weighting parameter is calculated from that evaluation. Using the weighting parameter, a new SLM is then constructed as a weighted average of the existing SLM and the new probabilities.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.