Patent · US Active

Suffix tree similarity measure for document clustering

US10565233B2 · kind B2 · utility

6Cited by
4References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 17, 2014
Grant dateFeb 18, 2020
Priority date
Expiry dateFeb 1, 2035

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/93
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The subject innovation provides for systems and methods to facilitate weighted suffix tree clustering. Conventional suffix tree cluster models can be augmented by incorporating quality measures to facilitate improved performance. Further the quality measure can be employed in determining cluster labels that show improvements in accuracy over conventional means. Additionally “stopnodes” can be defined to facilitate traversing suffix tree models efficiently. Quality measurements can be determined based in part on weighting factors applied to terms in a vector model, said terms being mapped from a suffix tree model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.