Patent · US Active

Transliterating semitic languages including diacritics

US8612206B2 · kind B2 · utility

25Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 8, 2009
Grant dateDec 17, 2013
Priority date
Expiry dateApr 9, 2032

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/53
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present disclosure describes a system and method of transliterating Semitic languages with support for diacritics. An input module receives and pre-processes Romanized character and forwards the pre-processed Romanized characters to a transliteration engine. The transliteration engine selects candidate transliteration rules, applies the rules, and scores and ranks the results for output. To optimize search for candidate transliteration rules, the transliteration engine may apply word-stemming strategies to process inflections indicated by affixes. The present disclosure further describes optimizations as pre-processing emphasis text, caching, dynamic transliteration rule pruning, and buffering/throttling input. The system and methods are suitable for multiple applications including but not limited to web applications, windows applications, client-server applications and input method editors such as those via Microsoft Text Services Framework TSF™.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.