Corpus quality processing for a specified task
US12242797B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 6, 2023 |
| Grant date | Mar 4, 2025 |
| Priority date | — |
| Expiry date | May 19, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/151
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Processing within a computing environment is facilitated using a corpus processing system to assess and enhance quality of a corpus of unstructured documents for a specified task. The processing includes referencing, by a corpus processing engine, the corpus of unstructured documents to obtain unstructured document data, and applying, by a corpus quality metrics engine, a set of quality metrics to the document data to obtain a set of quality metric scores. Further, the process includes automatically selecting, by a quality metric selection engine, a subset of task-relevant quality metrics using the quality metric scores and the specified task, and automatically transforming, at least in part, multiple documents of the corpus to remediate one or more identified issues with the documents. The automatically transforming results in remediated documents tuned for the specified task, which are provided for the specified task to be performed.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.