Patent · US Expired

Method and apparatus for ensuring proper functionality of a shared memory, multiprocessor system

US6393590B1 · kind B1 · utility

21Cited by
16References
15Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 22, 1998
Grant dateMay 21, 2002
Priority date
Expiry dateDec 22, 2018

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F11/0724
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present invention relates to a method and apparatus for ensuring fault detection and system recovery in a multiprocessor computing system. This system comprises a multitude of processing element modules, input/output processor modules and shared memory modules. Each module within the system includes an identical period sanity timer capable to reset the module once a predetermined limit count is reached. If a global clear signal is not received from the operating system scheduler by all modules prior to the expiry of the sanity timers, a system-wide reset is effected. Each processing element module within the system further includes a watchdog timer capable to reset the module once a predetermined limit count is reached. If a process is not run by the operating system scheduler on the processing element before the expiry of the watchdog timer, effectively clearing the watchdog timer, the processing element is reset and removed from service.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.