Patent · US Active

Method of determining Unicode values corresponding to the text in digital documents

US7636885B2 · kind B2 · utility

18Cited by
4References
13Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 6, 2006
Grant dateDec 22, 2009
Priority date
Expiry dateJul 18, 2028

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/126
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method of determining Unicode values corresponding to the text in digital documents includes: providing a digital document containing information related to the text in the document, the information including at least one set of data selected from the group consisting of: the numerical character code comprised by a single byte value or a sequence of multiple bytes, the glyph name corresponding to the character code for simple fonts, the code-to-Unicode mapping provided by a ToUnicode CMap, and font outline data embedded in the document; obtaining the information related to the text from the document; and determining the Unicode values corresponding to a specific code of a specific font on a per-glyph basis by executing a cascade of determination steps for each code separately, the cascade being executed in a predetermined sequence using different sources of information.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.