Michael Stumm: Publications

Paper Details

Reference:

Michael Stumm and Songnian Zhou,
"Fault tolerant distributed shared memory",
In Proceedings 2nd IEEE Symposium on Distributed and Parallel Systems (SDPS'90), Dallas, TX, USA, IEEE Computer Society, Washington, DC, USA, December, 1990, pp. 719–724.

Download:

PDF

Abstract:

Distributed shared memory (DSM) has received increased attention as a mechanism for interprocess communication in loosely-coupled distributed systems because of its perceived advantages over direct use of message passing or remote procedure calls. One problem with most DSM algorithms proposed to date, however, is that they do not tolerate faults. The paper extends four basic DSM algorithms to tolerate single host failures and argues that this degree of fault tolerance is sufficient for most applications. It analyzes the performance behavior of the fault tolerant DSM algorithms and shows that for some algorithms the additional overhead for fault tolerance is quite small, but that for other algorithms the extra overhead can be substantial and even unpredictable.

Keywords:

Distributed shared memory, interprocess communication, loosely-coupled distributed systems, single host failures, fault tolerance, performance behavior

Reference Info:

DOI: 10.1109/SPDP.1990.143633
ACMid: 2061653
ISBN: 0-8186-2087-0

BibTeX:

@inproceedings(Stumm-SDPS90,
    author = {Michael Stumm and Songnian Zhou},
    title = {Fault tolerant distributed shared memory},
    booktitle = {Proceedings 2nd IEEE Symposium on Distributed and Parallel Systems (\textbf{SDPS'90})},
    location = {Dallas, TX, USA},
    organization = {IEEE Computer Society},
    address = {Washington, DC, USA},
    month = {December},
    year = {1990},
    pages = {719-724},
    doi = {10.1109/SPDP.1990.143633},
    isbn = {0-8186-2087-0},
    keywords = {Distributed shared memory, interprocess communication, loosely-coupled distributed systems, single host failures, fault tolerance, performance behavior}
)