Methods, systems, and computer-readable media for generating labelled datasets
US12367230B2 · kind B2 · utility
Inventors
Key dates
| Filing date | Sep 8, 2023 |
| Grant date | Jul 22, 2025 |
| Priority date | — |
| Expiry date | Sep 8, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/279
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method comprises determining token(s) from financial documents and determining a set of preliminary attribute labels for the token(s), wherein the set is associated with attribute type(s). The method further comprises providing the set for each token to an attribute prediction model to determine, for the token, a confidence value for each attribute type(s), determining subsets of token, each subset being associated with a respective document of the plurality of documents and determining a set of refined labels for each document based on the confidence values, wherein the set of refined labels comprises a value for attribute type(s).
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.