Patent · US Expired

System and method for preventing data corruption in computer system clusters

US7631066B1 · kind B1 · utility

138Cited by
40References
21Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 25, 2002
Grant dateDec 8, 2009
Priority date
Expiry dateJan 16, 2024

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F2209/5014
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems, methods, apparatus and software can make use of coordinator resources and SCSI-3 persistent reservation commands to determine which nodes of a cluster should be ejected from the cluster, thereby preventing them from corrupting data on a shared data resource. Fencing software operating on the cluster nodes monitors the cluster for a cluster partition (split-brain) event. When such an event occurs, software on at least two of the nodes attempts to unregister other nodes from a majority of coordinator resources. The node that succeeds in gaining control of the majority of coordinator resources survives. Nodes failing to gain control of a majority of coordinator resources remove themselves from the cluster. The winning node can also proceed to unregister ejected nodes from shared data resources. These operations can be performed in parallel to decrease failover time. The software can continue to execute on all nodes to prevent additional problems should a node erroneously attempt to reenter the cluster.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.