System and method for analysis of structured and unstructured data
US10922358B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 12, 2018 |
| Grant date | Feb 16, 2021 |
| Priority date | — |
| Expiry date | Jan 31, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N7/01
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The invention relates to a computer-implemented system and method for analyzing unstructured data from a plurality of input files, and standardizing the data to a format that can be consumed by downstream systems. The method may comprise the steps of: receiving at least one input file to be analyzed, wherein the at least one input file includes the structured and unstructured data, splitting the at least one input file into a plurality of documents, classifying each page of the plurality of documents as one of structured or unstructured data, parsing the pages of the plurality of documents classified as unstructured data, extracting relevant data from the parsed pages, mapping each of the extracted relevant data to standardized output; and generating canonical data sets based on the standardized outputs.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.