Patent · US Expired

Method and system of creating and using Chinese language data and user-corrected data

US7228267B2 · kind B2 · utility

1Cited by
4References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 27, 2002
Grant dateJun 5, 2007
Priority date
Expiry dateJun 15, 2025

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/53
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Unique identifiers for each of a plurality of Chinese Pinyin syllables are generated and stored in an array of identifiers. A plurality of Hanzi character candidate lists is also generated, each list including Hanzi character candidates associated with a Pinyin syllable. Each identifier in the array has an array index, and each Hanzi character candidate in each list has a candidate index in the list. For each of a plurality of words having multiple Pinyin syllables, a data record including a key and a value is then generated. In a data record for a word, the key is an array index of the identifier in the array of identifiers and tone information for each of the multiple Pinyin syllables of the word, and the value is a candidate index, in the list of candidates associated with each of the Pinyin syllables, of the candidate that represents each of the Pinyin syllables.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.