Method for parsing, indexing and searching world-wide-web pages
US5864863A · kind A · utility
Assignee
Inventor
Key dates
| Filing date | Aug 9, 1996 |
| Grant date | Jan 26, 1999 |
| Priority date | — |
| Expiry date | Aug 9, 2016 |
Classification
- Technology area (CPC Y)Emerging Cross-Sectional Technologies
- CPC primaryY10S707/99945
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system indexes Web pages of the Internet. The pages are stored in computers distributively connected to each other by a communications network. Each page has a unique URL (universal record locator). Some of the pages can include URL links to other pages. A communication interface connected to the Internet is used for fetching a batch of Web pages from the computers in accordance with the URLs and URL links. The URLs are determined by an automated Web browser connected to the communications interface. A parser sequentially partitions the batch of specified pages into indexable words where each word represents an indexable portion of information of a specific page, or the word represents an attribute of one or more portions of the specific page. The parser sequentially assigns locations to the words as they are parsed. The locations indicates the unique occurrences of the word in the Web. The output of the parser is stored in a memory as an index. The index includes one index entry for each unique word. Each index entry also includes one or more location entries indicating where the unique word occurs in the Web. A query module parses a query into terms and operators. The operators…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.