You are here: Cluster » MPI

MPI

In a few cases users experienced random-like application crashes. In these cases usage of the extra parameter "--mca orte_base_help_aggregate 0 --verbose" for the actual mpirun or mpiexec call enforces printing an error message like this: "=>> PBS: job killed: mem job total 276564 kb exceeded limit 204800 kb". It might appear that pbs will not give you the standard memory error message but the one mentioned before. Be aware of the differences of pmem (memory per cpu) and mem (total memory of the application).

In general the ompi_info command is an useful tool in case something does not work how it should.

-- Cluster.salzmann - 01 Nov 2013
This site is powered by FoswikiCopyright © by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
Ideas, requests, problems regarding Foswiki? Send feedback