Estimation of parameters for machine translation without in-domain parallel data
US9652453B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 14, 2014 |
| Grant date | May 16, 2017 |
| Priority date | — |
| Expiry date | Jan 25, 2035 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/3344
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method for estimating parameters for features of a translation scoring function for scoring candidate translations in a target domain are provided. Given a source language corpus for a target domain, a similarity measure is computed between the source corpus and a target domain multi-model, which may be a phrase table derived from phrase tables of comparative domains, weighted as a function of similarity with the source corpus. The parameters of the log-linear function for these comparative domains are known. A mapping function is learned between similarity measure and parameters of the scoring function for the comparative domains. Given the mapping function and the target corpus similarity measure, the parameters of the translation scoring function for the target domain are estimated. For parameters where a mapping function with a threshold correlation is not found, another method for obtaining the target domain parameter can be used.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.