Patent · US Expired

Segmented document indexing and search

US6631373B1 · kind B1 · utility

61Cited by
8References
67Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 29, 2000
Grant dateOct 7, 2003
Priority date
Expiry dateFeb 29, 2020

Classification

  • Technology area (CPC Y)Emerging Cross-Sectional Technologies
  • CPC primaryY10S707/99945
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

There is provided a text search apparatus capable of dividing a structured document such as an HTML document into segments, and presenting segments containing a given search key as the search result, thereby providing a part of the document matching the search condition as the result of search. The document is divided into segments by specified tags, and a level of association with an adjacent segment is calculated. A header is detected by a header tag, and the header information is added to the segment contained in the range of the header. Segments are divided and re-integrated according to the level of association therebetween, and indexes are prepared. A search is executed for two indexes, and the level of matching is calculated by weighting the search results for the indexes, and the search result judged according to such level of matching is stored or outputted for each segment.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.