Patent · US Active

Processing data from multiple sources

US10642850B2 · kind B2 · utility

0Cited by
15References
43Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 14, 2017
Grant dateMay 5, 2020
Priority date
Expiry dateJun 24, 2038

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F16/284
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In a first aspect, a method includes, at a node of a Hadoop cluster, the node storing a first portion of data in HDFS data storage, executing a first instance of a data processing engine capable of receiving data from a data source external to the Hadoop cluster, receiving a computer-executable program by the data processing engine, executing at least part of the program by the first instance of the data processing engine, receiving, by the data processing engine, a second portion of data from the external data source, storing the second portion of data other than in HDFS storage, and performing, by the data processing engine, a data processing operation identified by the program using at least the first portion of data and the second portion of data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.