Discussion:
MPI Reliability
(too old to reply)
hassan jafari
2011-11-05 08:14:01 UTC
Permalink
Hi everybody.
I would to know about how MPI provides reliability in any aspects, if
done? for example checkpoint/restart, and independency from underlying
layer.

Thanks for your attention. please help me.
Georg Bisseling
2012-07-20 17:41:34 UTC
Permalink
The simple answer is that you have to do it yourself.

Reliability features are often asked for but it seems that nobody wants to
pay the cost in additional memory, storage space and cpu/wallclock time.
This is for a simple reason:

It is much easier for an application to save all necessary state on disk
than it is for a library or even the operating system to save all
necessary OS, network and buffered I/O state.
Post by hassan jafari
Hi everybody.
I would to know about how MPI provides reliability in any aspects, if
done? for example checkpoint/restart, and independency from underlying
layer.
Thanks for your attention. please help me.
--
This signature was intentionally left almost blank.
http://www.this-page-intentionally-left-blank.org/
Loading...