Patent · US Active

Token-level interpolation for class-based language models

US9734826B2 · kind B2 · utility

7Cited by

8References

20Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

Michael Levit · San Jose, US
Sarangarajan Parthasarathy · Mountain View, US
Andreas Stolcke · Berkeley, US
Shuangyu Chang · Fremont, US

Key dates

Filing date	Mar 11, 2015
Grant date	Aug 15, 2017
Priority date	—
Expiry date	Mar 11, 2035

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/1815
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Optimized language models are provided for in-domain applications through an iterative, joint-modeling approach that interpolates a language model (LM) from a number of component LMs according to interpolation weights optimized for a target domain. The component LMs may include class-based LMs, and the interpolation may be context-specific or context-independent. Through iterative processes, the component LMs may be interpolated and used to express training material as alternative representations or parses of tokens. Posterior probabilities may be determined for these parses and used for determining new (or updated) interpolation weights for the LM components, such that a combination or interpolation of component LMs is further optimized for the domain. The component LMs may be merged, according to the optimized weights, into a single, combined LM, for deployment in an application scenario.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.