Patent · US Expired

Method and system for extracting pairs of multilingual terminology from an aligned multilingual text

US6236958A · kind A · utility

98Cited by
11References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 15, 1998
Grant dateMay 22, 2001
Priority date
Expiry dateMay 15, 2018

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/45
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A terminology extraction system which allows for automatic creation of bilingual terminology has a source text which comprises at least one sequence of source terms, aligned with a target text which also comprises at least one sequence of target terms. A term extractor builds a network from each source and target sequence wherein each node of the network comprises at least one term and such that each combination of source terms is included within one source node and each combination of target terms is included within one target node. The term extractor links each source node with each target node, and through a flow optimization method selects relevant links in the resulting network. Once the term extractor has been run on the entire set of aligned sequences, a term statistics circuit computes an association score for each pair of linked source/target terms, and finally the scored pairs of linked source/target term that are considered relevant bilingual terms are stored in a bilingual terminology database. The whole process can be iterated in order to improve the strength of the bilingual links.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.