Suffix tree similarity measure for document clustering
US8676815B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 6, 2009 |
| Grant date | Mar 18, 2014 |
| Priority date | — |
| Expiry date | Oct 20, 2029 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/93
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The subject innovation provides for systems and methods to facilitate weighted suffix tree clustering. Conventional suffix tree cluster models can be augmented by incorporating quality measures to facilitate improved performance. Further the quality measure can be employed in determining cluster labels that show improvements in accuracy over conventional means. Additionally “stopnodes” can be defined to facilitate traversing suffix tree models efficiently. Quality measurements can be determined based in part on weighting factors applied to terms in a vector model, said terms being mapped from a suffix tree model.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.