Patent · US Active

Generating tagged content from text of an electronic document

US12056434B2 · kind B2 · utility

1Cited by
1References
40Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 6, 2023
Grant dateAug 6, 2024
Priority date
Expiry dateJan 6, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06T2207/30176
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Techniques for generating formatting tags for textual content obtained from a source electronic document are disclosed. A system parses a digital file to obtain information about characters in an electronic document. The system applies tags to text generated based on the textual content of the electronic document by creating segments of textually-consecutive characters and applying corresponding text formatting style tags to the segments. The system further identifies segments of text overlapping bounding boxes in the electronic document. The system generates textual content including a segment of text and a corresponding hyperlink associated with the segment of text. The system further generates textual content by selectively applying line breaks from the source electronic document in the textual content.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.