Your information is very deep and detailed.
I tried with your suggestion to set ""-mca
plm_rsh_assume_same_shell 0", it still does not work though. My
situation is that we start a c-shell script from bash shell, which in
turn invokes mpirun to other slave nodes. These slave nodes have
bash login shell by default, and mpirun will execute another c-shell
script on each node, will these mess thing up a little bit and related
to the orted missing message?
On Jun 28, 2011, at 3:52 PM, yanyg_at_[hidden] wrote:
I looked a little deeper into this. I keep forgetting that we changed
our default settings a few years ago. In the dim past, OMPI would
always probe the remote node to find out what shell it was using,
and then use the proper command syntax for that shell. However,
people complained about the extra time during launch, and very
very few people actually used mis-matched shells.
So we changed the setting the other way to default to assuming the
remote shell is the same as the local one. For those like yourself
that actually do have a mismatch, we left a parameter you can set
to override that assumption. Just add "-mca
plm_rsh_assume_same_shell 0" to your mpirun cmd line and it
should resolve the problem.