State extrapolation for automated and semi-automated crawling architecture
US10387379B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Sep 29, 2015 |
| Grant date | Aug 20, 2019 |
| Priority date | — |
| Expiry date | Mar 31, 2037 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04L67/02
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system for automated acquisition of content from an application includes a link extraction controller that receives an identification of a target state of the application directly reachable from an intermediate state and a specification of a user interface element of the intermediate state actuated by a user to arrive at the target state. After navigating to the intermediate state in an executing instance of the application and extracting a tree of user interface widgets, the link extraction controller identifies widget sub-trees that have at least a threshold level of commonality with a reference widget sub-tree that includes the specified user interface element. The link extraction controller adds states, including the target state, reachable by user actuation of the identified widget sub-trees to a state list. A scraper module extracts text and metadata from each of the states in the state list for storage in a data store.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.