Patent · US Active

Methods, systems, and computer-readable media for generating labelled datasets

US12367230B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Inventors

Key dates

Filing dateSep 8, 2023
Grant dateJul 22, 2025
Priority date
Expiry dateSep 8, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/279
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method comprises determining token(s) from financial documents and determining a set of preliminary attribute labels for the token(s), wherein the set is associated with attribute type(s). The method further comprises providing the set for each token to an attribute prediction model to determine, for the token, a confidence value for each attribute type(s), determining subsets of token, each subset being associated with a respective document of the plurality of documents and determining a set of refined labels for each document based on the confidence values, wherein the set of refined labels comprises a value for attribute type(s).

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.