Patent · US Expired

Method and apparatus for distribution-based language model adaptation

US7043422B2 · kind B2 · utility

204Cited by
0References
10Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 4, 2001
Grant dateMay 9, 2006
Priority date
Expiry dateJul 25, 2024

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/1815
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method and apparatus are provided for adapting a language model to a task-specific domain. Under the method and apparatus, the relative frequency of n-grams in a small training set (i.e. task-specific training data set) and the relative frequency of n-grams in a large training set (i.e. out-of-domain training data set) are used to weight a distribution count of n-grams in the large training set. The weighted distributions are then used to form a modified language model by identifying probabilities for n-grams from the weighted distributions.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.