Patent · US Expired

Method of retrieving no word separation text data and a data retrieving apparatus therefor

US6546401B1 · kind B1 · utility

15Cited by
3References
40Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 17, 2000
Grant dateApr 8, 2003
Priority date
Expiry dateFeb 14, 2021

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99948
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Full text data is divided into words to generate word separation data. All character strings are extracted from the full text data, each character string including N characters. The word separation and position data is attached to each character string to generate index data. In word retrieving, character and segmentation agreement between query data and all character strings is checked. Word retrieving and/or character string retrieving are effected according to a selection command. The word separation data may include leading or trailing end data. In the word retrieving mode, the leading end of the first character and the trailing end of the last character are checked but the intermediate portion is not checked. Continuity of retrieve character strings is checked with reference to position data thereof. The word retrieving mode includes a number of modes including the completion agreement mode. A non-target word in retrieving is detected according to a word class and the word separation data is not attached to the non-target word. The word separation data is not attached to words of the affix. Sets of full text data are retrieved and the matching degrees are detected and the sets…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.