Patent · US Expired

System and method of creating and using compact linguistic data

US7269548B2 · kind B2 · utility

47Cited by
5References
17Claims
0Family size

Inventors

Key dates

Filing dateNov 7, 2002
Grant dateSep 11, 2007
Priority date
Expiry dateJun 23, 2025

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99937
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method of creating and using compact linguistic data are provided. Frequencies of words appearing in a corpus are calculated. Each unique character in the words is mapped to a character index, and characters in the words are replaced with the character indexes. Sequences of characters are mapped to substitution indexes, and the sequences of characters in the words are replaced with the substitution indexes. The words are grouped by common prefixes, and each prefix is mapped to location information for the group of words which start with the prefix.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.