Patent · US Expired

Clustering based text classification

US7366705B2 · kind B2 · utility

89Cited by
32References
35Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 16, 2004
Grant dateApr 29, 2008
Priority date
Expiry dateNov 20, 2025

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/355
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods for clustering-based text classification are described. In one aspect text is clustered as a function of labeled data to generate cluster(s). The text includes the labeled data and unlabeled data. Expanded labeled data is then generated as a function of the cluster(s). The expanded label data includes the labeled data and at least a portion of unlabeled data. Discriminative classifier(s) are then trained based on the expanded labeled data and remaining ones of the unlabeled data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.