Method of retrieving no word separation text data and a data retrieving apparatus therefor
US6546401B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 17, 2000 |
| Grant date | Apr 8, 2003 |
| Priority date | — |
| Expiry date | Feb 14, 2021 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99948
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Full text data is divided into words to generate word separation data. All character strings are extracted from the full text data, each character string including N characters. The word separation and position data is attached to each character string to generate index data. In word retrieving, character and segmentation agreement between query data and all character strings is checked. Word retrieving and/or character string retrieving are effected according to a selection command. The word separation data may include leading or trailing end data. In the word retrieving mode, the leading end of the first character and the trailing end of the last character are checked but the intermediate portion is not checked. Continuity of retrieve character strings is checked with reference to position data thereof. The word retrieving mode includes a number of modes including the completion agreement mode. A non-target word in retrieving is detected according to a word class and the word separation data is not attached to the non-target word. The word separation data is not attached to words of the affix. Sets of full text data are retrieved and the matching degrees are detected and the sets…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.