Patent · US Expired

Morphological analysis method and device and Japanese language morphological analysis method and device

US6098035A · kind A · utility

13Cited by
7References
34Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 19, 1998
Grant dateAug 1, 2000
Priority date
Expiry dateMar 19, 2018

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F40/268
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

There is provided a morphological analysis method and device whereby, even if unknown words are present, processing can be effected with high accuracy and at high speed and economy of resources can be achieved. Expanded characters e.sub.i are generated by adding to each character c.sub.i of input text, in addition to word division information d.sub.i, expansion information including required arbitrarily selectable information such as tag information, and all possible expanded character sequences are generated. Beforehand, by training, the partial chain probabilities (appearance probabilities) of N-gram (where, normally N=1 or 2 or 3) character sequences are stored in an expanded character table. The partial character sequences of the expanded character sequences are successively extracted from the beginning of the expanded character sequence and the respective partial chain probabilities are found by referring to the expanded character table, and the product of the thus-found partial chain probabilities is obtained. This product is found for all the expanded character sequences, and analysis results etc. consisting of a row of word sequences in order of character sequences correspo…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.