Patent · US Expired

Consistency checker for documents containing japanese text

US6175834A · kind A · utility

67Cited by
8References
39Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 24, 1998
Grant dateJan 16, 2001
Priority date
Expiry dateJun 24, 2018

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99943
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A Consistency Checker provides an improved method of analyzing a Japanese text document to identify inconsistently spelled words. The Consistency Checker utilizes a Reading Pair Database (RPD) and a Compressed Lexicon Database (CLD) to determine the reading units within a word, to calculate a Reading Pair Identification Number (RID) for each reading unit, to calculate a Sense Identification Number (SID) for each word, and to calculate a Spelling Variant Identification Number (SVID) for each word. Spelling variants are generated by combining variations of individual RIDs in the RID array. A Registry is updated to maintain statistics on all of the words within the document. An error field within the Registry indicates that the document contains more than one spelling variant of a particular word. The client program can access the Registry to alert a user to inconsistencies discovered in the document. The RPD comprises a list of reading pairs correlating Japanese text reading units of one character set with equivalent Japanese text reading units of another character set. Equivalent reading units from each character set are combined to form a reading pair and each reading pair is assig…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.