Cognitive document quality determination with automated heuristic generation
US10902044B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 2, 2018 |
| Grant date | Jan 26, 2021 |
| Priority date | — |
| Expiry date | Apr 2, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N20/00
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Techniques for cognitive document quality determination and automated heuristic generation are provided. A plurality of documents is received, where each of the plurality of documents contains natural language text. A plurality of values is determined for a first plurality of predefined attributes of the plurality of documents. A plurality of quality scores is generated for the plurality of documents by processing the plurality of values using a machine learning model, where the plurality of quality scores indicate a suitability of each of the plurality of documents to be processed using a target processing operation. A subset of documents is identified from the plurality of documents having respective quality scores below a predefined threshold. The subset of documents is flagged for further processing. At least one document of the plurality of documents that is not flagged is selectively processed using the target processing operation.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.