Patent · US Active

Double map reduce distributed computing framework

US8321454B2 · kind B2 · utility

20Cited by
8References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 14, 2010
Grant dateNov 27, 2012
Priority date
Expiry dateMay 17, 2031

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F9/5066
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method, apparatus, system, article of manufacture, and data structure provide the ability to perform a sorted map-reduce job on a cluster. A cluster of two or more computers is defined by installing a map-reduce framework onto each computer and formatting the cluster by identifying the cluster computers, establishing communication between them, and enabling the cluster to function as a unit. Data is placed into the cluster where it is distributed so that each computer contains a portion of the data. A first map function is performed where each computer sorts their respective data and creates an abstraction that is a representation of the data. The abstractions are exchanged and merged to create complete abstraction. A second map function searches the complete abstraction to redistribute and exchange the data across the computers in the cluster. A reduce function is performed in parallel to produce a result.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.