System and method for detecting equations
US8818033B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 27, 2012 |
| Grant date | Aug 26, 2014 |
| Priority date | — |
| Expiry date | Jan 22, 2033 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V30/244
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method of extracting formulas in an electronic image of a document using optical character recognition (OCR) is disclosed. In one example, the method comprises analyzing the electronic image, including a plurality of text lines, to generate a plurality of bounding blocks, each bounding block associated with a text line detected within the electronic image, searching the plurality of text lines to detect at least one character matching one of a plurality of character groups, calculating a symbol density of each of the plurality of character groups for each of the plurality of text lines, and classifying each of the plurality of text lines as at least one of an equation block type, an inline equation block type, and a descriptive block type, based on the symbol density, wherein each of the plurality of text lines classified as the equation block type is extracted.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.