Patent · US Active

Computer systems for classifying multilingual text

US11087098B2 · kind B2 · utility

0Cited by
13References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 18, 2018
Grant dateAug 10, 2021
Priority date
Expiry dateMay 17, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N20/20
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Various examples are directed to systems and methods for identifying textual information regarding a first topic. A computer system may access a plurality of text units and detect that a first text unit of the plurality of text units is in a first language. The computer system may access a first language keyword set for the first topic, where the first language keyword set comprises a first plurality of keywords associated with the first language. The computer system may determine a first relevance score for the first text unit based at least in part on the first language keyword set. If the first relevance score is greater than a relevance score threshold, the computer system translates the first text unit to a base language and determines a text unit classification for the first text unit using a classification model trained with training data in the base language.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.