Patent · US Active

Suffix tree similarity measure for document clustering

US8676815B2 · kind B2 · utility

29Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 6, 2009
Grant dateMar 18, 2014
Priority date
Expiry dateOct 20, 2029

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/93
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The subject innovation provides for systems and methods to facilitate weighted suffix tree clustering. Conventional suffix tree cluster models can be augmented by incorporating quality measures to facilitate improved performance. Further the quality measure can be employed in determining cluster labels that show improvements in accuracy over conventional means. Additionally “stopnodes” can be defined to facilitate traversing suffix tree models efficiently. Quality measurements can be determined based in part on weighting factors applied to terms in a vector model, said terms being mapped from a suffix tree model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.