Patent · US Active

System and method of creating and using compact linguistic data

US7809553B2 · kind B2 · utility

17Cited by
26References
14Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 17, 2007
Grant dateOct 5, 2010
Priority date
Expiry dateAug 27, 2027

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99937
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method of creating and using compact linguistic data are provided. Frequencies of words appearing in a corpus are calculated. Each unique character in the words is mapped to a character index, and characters in the words are replaced with the character indexes. Sequences of characters are mapped to substitution indexes, and the sequences of characters in the words are replaced with the substitution indexes. The words are grouped by common prefixes, and each prefix is mapped to location information for the group of words which start with the prefix.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.