Hierarchical information extraction using document segmentation and optical character recognition correction
US9715625B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 27, 2012 |
| Grant date | Jul 25, 2017 |
| Priority date | — |
| Expiry date | Jun 30, 2032 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems, methods, and media for extracting and processing entity data included in an electronic document are provided herein. Methods may include executing one or more extractors to extract entity data within an electronic document based upon an extraction model for the document, selecting extracted entity data via one or more experts, each of the experts applying at least one business rule to organize at least a portion of the selected entity data into a desired format, and providing the organized entity data for use by an end user.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.