Method and system of creating and using Chinese language data and user-corrected data
US7228267B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 27, 2002 |
| Grant date | Jun 5, 2007 |
| Priority date | — |
| Expiry date | Jun 15, 2025 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/53
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Unique identifiers for each of a plurality of Chinese Pinyin syllables are generated and stored in an array of identifiers. A plurality of Hanzi character candidate lists is also generated, each list including Hanzi character candidates associated with a Pinyin syllable. Each identifier in the array has an array index, and each Hanzi character candidate in each list has a candidate index in the list. For each of a plurality of words having multiple Pinyin syllables, a data record including a key and a value is then generated. In a data record for a word, the key is an array index of the identifier in the array of identifiers and tone information for each of the multiple Pinyin syllables of the word, and the value is a candidate index, in the list of candidates associated with each of the Pinyin syllables, of the candidate that represents each of the Pinyin syllables.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.