Generating, filtering, and combing structured data records using machine learning
US12293843B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 26, 2024 |
| Grant date | May 6, 2025 |
| Priority date | — |
| Expiry date | Aug 26, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG16H10/20
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for processing a large corpus of unstructured data using an extraction neural network to generate a corresponding collection of structured data records. According to one aspect, there is provided a method that includes obtaining a set of input text sequences, and generating a collection of structured data records from the set of input text sequences using an extraction neural network. Each structured data record defines a structured representation of a corresponding input text sequence with reference to a predefined schema of semantic categories. The collection of structured data records can be filtered to identify and remove structured data records that are predicted to be unreliable. The collection of structured data records can then be processed to generate an output that is directed to a selected topic and that aggregates information from across multiple structured data records.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.