Am 04.11.2021 um 00:38 schrieb Allin Cottrell:
On Thu, 4 Nov 2021, Sven Schreiber wrote:
> Am 03.11.2021 um 17:32 schrieb Riccardo (Jack) Lucchetti:
>> Could it be that there is an extreme corner case, that occurs with
>> very small probability, that makes some command fail with error under
>> very extreme circumstances?
>
> However, when I run the thing sequentially with the same (high, but not
> that high, actually) number of replications, there is no problem.
> Therefore I believe it must be something more MPI-specific.
Sven, why don't you send us (some of us, offlist if you wish) all
that's needed to produce the problem (if only sometimes)?
There are tools we can use to investigate this, valgrind in particular.
Right, will do that eventually. Another thing that occurred to me is
that the output from within an MPI block is notoriously problematic on
Windows, IIRC. So I will do some testing on Linux to see if I can get
some informative output from debugging statements or so.
thanks
sven