Patent · US Active

Method for determining output data for a plurality of text documents

US11263251B2 · kind B2 · utility

0Cited by
0References
12Claims
0Family size

Assignee

Inventor

Key dates

Filing dateApr 16, 2019
Grant dateMar 1, 2022
Priority date
Expiry dateOct 14, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/30
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Provided is a method for determining output data for a plurality of text documents, including the steps of: providing a feature matrix as input data; wherein the feature matrix includes information about frequencies of a plurality of features within the plurality of text documents; clustering the feature matrix using a clustering algorithm into at least one clustering matrix; wherein the at least one clustering matrix includes information about the cluster membership of each document of the plurality of documents or each feature of the plurality of features, assigning at least one score to each feature of the plurality of features based on the at least one clustering matrix; ranking the plurality of features based on their assigned scores; and outputting the ranked features as output data. A corresponding computer program product and system is also provided.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.