Method of web crawling utilizing crawl numbers
US6638314B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 26, 1998 |
| Grant date | Oct 28, 2003 |
| Priority date | — |
| Expiry date | Jun 26, 2018 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F16/9538
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer based system and method of retrieving information pertaining to electronic documents on a computer network is disclosed. The method includes maintaining a database that associates each electronic document with a corresponding crawl number that indicates the most recent crawl during which a change to the document was detected. During a subsequent crawl, electronic documents that have changed since the previous crawl are retrieved, and selected data is stored in a database. The retrieved document information is marked with a crawl number. During subsequent searches, crawl numbers are used to determine documents that have changed since a specified crawl.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.