Patent · US Active

System and method for detecting equations

US8818033B1 · kind B1 · utility

55Cited by
0References
12Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 27, 2012
Grant dateAug 26, 2014
Priority date
Expiry dateJan 22, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V30/244
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system and method of extracting formulas in an electronic image of a document using optical character recognition (OCR) is disclosed. In one example, the method comprises analyzing the electronic image, including a plurality of text lines, to generate a plurality of bounding blocks, each bounding block associated with a text line detected within the electronic image, searching the plurality of text lines to detect at least one character matching one of a plurality of character groups, calculating a symbol density of each of the plurality of character groups for each of the plurality of text lines, and classifying each of the plurality of text lines as at least one of an equation block type, an inline equation block type, and a descriptive block type, based on the symbol density, wherein each of the plurality of text lines classified as the equation block type is extracted.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.