System and method for automatically extracting multi-format data from documents and converting into XML
US9158744B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 13, 2013 |
| Grant date | Oct 13, 2015 |
| Priority date | — |
| Expiry date | Jan 25, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/117
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system, a computer-implemented method and a computer program product for extracting insurance data from one or more documents having one or more file formats and converting into Extensible Markup Language (XML) format is provided. The system comprises a user interface configured to facilitate one or more users to submit one or more documents related to insurance. The system further comprises a business type classification module configured to identify the one or more submitted documents based on a business type. Further, the system comprises a format classification module configured to identify file format of the one or more submitted documents. Furthermore, the system comprises an extraction and conversion module configured to match one or more headers in the one or more submitted documents with one or more pre-stored headers, extract insurance data corresponding to the one or more matched headers and convert the extracted insurance data into XML format.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.