Patent · US Active

Apparatus and methods for aligning words in bilingual sentences

US7672830B2 · kind B2 · utility

20Cited by

7References

24Claims

0Family size

Assignee

Xerox Corporation · US

Inventors

Cyril Goutte · Toronto, CA
Michel Adam Simard · Berwyn, US
Kenji Yamada · Yamanashi, JP
Eric Gaussier · Grenoble, FR
Arne Mauser · Mountain View, US

Key dates

Filing date	May 26, 2005
Grant date	Mar 2, 2010
Priority date	—
Expiry date	Aug 24, 2027

Classification

Technology area (CPC G)Physics
CPC primaryG06F40/45
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods are disclosed for performing proper word alignment that satisfy constraints of coverage and transitive closure. Initially, a translation matrix which defines word association measures between source and target words of a corpus of bilingual translations of source and target sentences is computed. Subsequently, in a first method, the association measures in the translation matrix are factorized and orthogonalized to produce cepts for the source and target words, which resulting matrix factors may then be, optionally, multiplied to produce an alignment matrix. In a second method, the association measures in the translation matrix are thresholded, and then closed by transitivity, to produce an alignment matrix, which may then be, optionally, factorized to produce cepts. The resulting cepts or alignment matrices may then be used by any number of natural language applications for identifying words that are properly aligned.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.