Patent · US Active

Annotating and collecting data-centric AI quality metrics considering user preferences

US12417230B2 · kind B2 · utility

0Cited by
4References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 9, 2022
Grant dateSep 16, 2025
Priority date
Expiry dateDec 9, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/285
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method, computer program, and computer system are provided for collecting and annotating data based on user preference. Unlabeled data corresponding to one or more entries within a dataset is received. Pseudo-labeled data is generated based on the unlabeled data. Based on one or more quality metrics, each entry from among the pseudo-labeled data is determining to be included within a final dataset. A user is prompted for annotations corresponding to entries of the pseudo-labeled data included within the final dataset. A determination is made as to whether additional data is needed based on comparing the final dataset to the one or more quality metrics, and the additional information is collected if the final dataset does not meet the quality metrics.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.