Segmented document indexing and search
US6631373B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 29, 2000 |
| Grant date | Oct 7, 2003 |
| Priority date | — |
| Expiry date | Feb 29, 2020 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99945
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
There is provided a text search apparatus capable of dividing a structured document such as an HTML document into segments, and presenting segments containing a given search key as the search result, thereby providing a part of the document matching the search condition as the result of search. The document is divided into segments by specified tags, and a level of association with an adjacent segment is calculated. A header is detected by a header tag, and the header information is added to the segment contained in the range of the header. Segments are divided and re-integrated according to the level of association therebetween, and indexes are prepared. A search is executed for two indexes, and the level of matching is calculated by weighting the search results for the indexes, and the search result judged according to such level of matching is stored or outputted for each segment.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.