Patent · US Expired

Method and apparatus for mapping multiword expressions to identifiers using finite-state networks

US7552051B2 · kind B2 · utility

13Cited by
13References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 13, 2002
Grant dateJun 23, 2009
Priority date
Expiry dateJun 15, 2024

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/289
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Multiword expressions are mapped to identifiers using finite-state networks. Each of a plurality of multiword expressions is encoded into a regular expression. Each regular expression encodes a base form common to a plurality of derivative forms defined by ones of the multiword expressions. Each of the plurality of regular expressions is compiled with factorization into a set of finite-state networks. A union of the finite-state networks in the set of finite-state networks is performed to define a multiword finite-state network and a set of subnets. The multiword finite-state network and the set of subnets are traversed to identify a path corresponding to one of the plurality of multiword expressions, wherein only transitions originating from the multiword finite-state network are accounted for to ascertain a path number identifying a base form of the one of the plurality of multiword expressions.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.