Patent · US Active

Identifying language of origin for words using estimates of normalized appearance frequency

US7689408B2 · kind B2 · utility

278Cited by
10References
11Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 1, 2006
Grant dateMar 30, 2010
Priority date
Expiry dateJan 22, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/263
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The language of origin of a word or named entity is predicted using estimates of frequency of occurrence of the word or named entity in different languages. In one embodiment, the normalized frequency of occurrence of the word or named entity in a variety of different languages is estimated and the values are used as features in a feature vector which is scored and used to identify language of origin.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.