Patent · US Active

Document content reconstruction

US9098471B2 · kind B2 · utility

7Cited by
0References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 6, 2012
Grant dateAug 4, 2015
Priority date
Expiry dateNov 1, 2033

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/10
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method, a storage medium and a system for document content reconstruction are provided in a digital content delivery and online education services platform to enable delivery of textbooks and other copyrighted material to multi-platform web browser applications. The method comprises ingesting a document page in an unstructured document format. The method further comprises extracting one or more images and metadata associated with the images and text and fonts associated with the texts from the document page. In addition, the method comprises coalescing text into paragraphs and creating a structured document page in a markup language format using the extracted images, text and fonts rendered with layout fidelity to the original ingested document page.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.