Patent · US Active

Efficiently representing word sense probabilities

US8280721B2 · kind B2 · utility

4Cited by
53References
10Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 29, 2008
Grant dateOct 2, 2012
Priority date
Expiry dateOct 11, 2030

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/268
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Word sense probabilities are compressed for storage in a semantic index. Each word sense for a word is mapped to one of a number of “buckets” by assigning a bucket score to the word sense. A scoring function is utilized to assign the bucket scores that maximizes the entropy of the assigned bucket scores. Once the bucket scores have been assigned to the word senses, the bucket scores are stored in the semantic index. The bucket scores stored in the semantic index may be utilized to prune one or more of the word senses prior to construction of the semantic index. The bucket scores may also be utilized to prune and rank the word senses at the time a query is performed using the semantic index.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.