Patent · US Active

Distributed system for large volume deep web data extraction

US10210255B2 · kind B2 · utility

29Cited by
2References
3Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 31, 2015
Grant dateFeb 19, 2019
Priority date
Expiry dateOct 11, 2036

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04L67/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A distributed system for large volume deep web data extraction that is extremely scalable, allows multiple heterogeneous concurrent searches, has power web scrape result processing capabilities and uses a well defined, highly customizable, simplified, search agent configuration interface requiring minimal specialized programming knowledge. A scrape campaign control module receives scrape control and web spider configuration parameters through either a command line interface of an HTTP based application programming interface. The control module uses those parameters to have an arbitrary plurality of web spiders created and deployed by a plurality of servers. Scrape campaign results are presented as prescribed.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.