Patent · US Active

State extrapolation for automated and semi-automated crawling architecture

US10387379B2 · kind B2 · utility

0Cited by
13References
26Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 29, 2015
Grant dateAug 20, 2019
Priority date
Expiry dateMar 31, 2037

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04L67/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A system for automated acquisition of content from an application includes a link extraction controller that receives an identification of a target state of the application directly reachable from an intermediate state and a specification of a user interface element of the intermediate state actuated by a user to arrive at the target state. After navigating to the intermediate state in an executing instance of the application and extracting a tree of user interface widgets, the link extraction controller identifies widget sub-trees that have at least a threshold level of commonality with a reference widget sub-tree that includes the specified user interface element. The link extraction controller adds states, including the target state, reachable by user actuation of the identified widget sub-trees to a state list. A scraper module extracts text and metadata from each of the states in the state list for storage in a data store.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.