Patent · US Active

Corpus quality processing for a specified task

US12242797B2 · kind B2 · utility

0Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 6, 2023
Grant dateMar 4, 2025
Priority date
Expiry dateMay 19, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/151
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Processing within a computing environment is facilitated using a corpus processing system to assess and enhance quality of a corpus of unstructured documents for a specified task. The processing includes referencing, by a corpus processing engine, the corpus of unstructured documents to obtain unstructured document data, and applying, by a corpus quality metrics engine, a set of quality metrics to the document data to obtain a set of quality metric scores. Further, the process includes automatically selecting, by a quality metric selection engine, a subset of task-relevant quality metrics using the quality metric scores and the specified task, and automatically transforming, at least in part, multiple documents of the corpus to remediate one or more identified issues with the documents. The automatically transforming results in remediated documents tuned for the specified task, which are provided for the specified task to be performed.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.