Patent · US Active

System and method of automatic topic detection in text

US12001797B2 · kind B2 · utility

1Cited by
1References
15Claims
0Family size

Inventors

Key dates

Filing dateMay 12, 2021
Grant dateJun 4, 2024
Priority date
Expiry dateMay 26, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/04
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method and system for automatic topic detection in text may include receiving a text document of a corpus of documents and extracting one or more phrases from the document, based on one or more syntactic patterns. For each phrase, embodiments of the invention may: apply a word embedding neural network on one or more words of the phrase, to obtain one or more respective word embedding vectors; calculate a weighted phrase embedding vector, and compute a phrase saliency score, based on the weighted phrase embedding vector. Embodiments of the invention may subsequently produce one or more topic labels, representing one or more respective topics in the document, based on the computed phrase saliency scores, and may select one or more topic labels according to their relevance to the business domain of the corpus.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.