Patent · US Expired

Grouping words with equivalent substrings by automatic clustering based on suffix relationships

US6308149A · kind A · utility

260Cited by
17References
19Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 16, 1998
Grant dateOct 23, 2001
Priority date
Expiry dateDec 16, 2018

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/3344
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A set of words of a natural language is grouped by automatically obtaining suffix relation data that indicate a relation value for each of a set of relationships between suffixes that occur in the natural language, and, then, by automatically clustering the words in the set using the relation values from the suffix relation data, to obtain group data indicating groups of words. Two or more words in a group have suffixes as in one of the relationships and, preceding the suffixes, equivalent substrings. The relationships can be pairwise relationships, and the relation value can indicate the number of occurrences of a suffix pair. The suffix relation data can be obtained using an inflectional lexicon. Complete link clustering can be used.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.