Morphological analysis method and device and Japanese language morphological analysis method and device
US6098035A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Mar 19, 1998 |
| Grant date | Aug 1, 2000 |
| Priority date | — |
| Expiry date | Mar 19, 2018 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F40/268
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
There is provided a morphological analysis method and device whereby, even if unknown words are present, processing can be effected with high accuracy and at high speed and economy of resources can be achieved. Expanded characters e.sub.i are generated by adding to each character c.sub.i of input text, in addition to word division information d.sub.i, expansion information including required arbitrarily selectable information such as tag information, and all possible expanded character sequences are generated. Beforehand, by training, the partial chain probabilities (appearance probabilities) of N-gram (where, normally N=1 or 2 or 3) character sequences are stored in an expanded character table. The partial character sequences of the expanded character sequences are successively extracted from the beginning of the expanded character sequence and the respective partial chain probabilities are found by referring to the expanded character table, and the product of the thus-found partial chain probabilities is obtained. This product is found for all the expanded character sequences, and analysis results etc. consisting of a row of word sequences in order of character sequences correspo…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.