Machine learning techniques for normalization of unstructured data into structured data
US12032590B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 28, 2022 |
| Grant date | Jul 9, 2024 |
| Priority date | — |
| Expiry date | Dec 28, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F18/2413
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Various embodiments of the present disclosure provide methods, apparatus, systems, computing devices, computing entities, and/or the like for classifying unstructured data by: (i) generating probability scores of natural language classification labels for classifying unstructured data elements using an NLP-based model, (ii) generating probability scores of structured data classification labels for classifying the unstructured data elements using a classification-based model, and (iii) assigning classifications labels based on: a) the probability scores of the natural language classification labels if a distance measure difference associated with the natural language classification labels is greater than a predetermined distance, or b) a determination using an ensemble model if the distance measure difference is less than a predetermined distance.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.