Patent · US Active

Adaptive pattern learning for bilingual data mining

US8275604B2 · kind B2 · utility

17Cited by
2References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 18, 2009
Grant dateSep 25, 2012
Priority date
Expiry dateJul 25, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/45
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments for the adaptive learning of translation layout patterns to mine bilingual data are disclosed. In accordance with at least one embodiment, the adaptive learning of patterns to mine bilingual data includes processing a bilingual web page into a Document Object Model (DOM) tree. The embodiment further includes linking the bilingual snippet pairs of each node into a plurality bilingual snippet pairs. The embodiment also includes determining one or more best fit candidate patterns based on the plurality of translation snippets via a Support Vector Machine classifier. The embodiment additionally includes mining one or more translation pairs from the bilingual web page using the one or more best fit candidate patterns. The translation pairs are further stored in a data storage. The one or more translation pairs including at least one of a term pair, a phrase pair, or a sentence pair.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.