boost::mpi throws MPI_ERR_TRUNCATE on multiple isend/irecv transfers with same tag

Question 1

At the current version of boost, 1.55, boost::mpi does not guarantee non-overtaking messages. This in contrast to the underlying MPI API which does:

Order Messages are non-overtaking: If a sender sends two messages in succession to the same destination, and both match the same receive, then this operation cannot receive the second message if the first one is still pending. If a receiver posts two receives in succession, and both match the same message, then the second receive operation cannot be satisfied by this message, if the first one is still pending. This requirement facilitates matching of sends to receives. It guarantees that message-passing code is deterministic, if processes are single-threaded and the wildcard MPI_ANY_SOURCE is not used in receives.

The reason boost::mpi does not guarantee non-overtaking is that serialized data types are transferred in two MPI messages, one for size and one for payload, and irecv for the second message cannot be posted until the first message is examined.

A proposal to guarantee non-overtaking in boost::mpi is being considered. Further discussion can be found on the boost::mpi mailing list beginning here.

Question 2

The problem could be that you're waiting for all of your sends to complete and then for all of your receives. MPI is expecting your sends and receives to match in time as well as in number. What I mean when I say that is that you can't finish all of your send calls without also having your receive calls progressing.

The way MPI usually handles sending a message is that when you call send, it will return from the call as soon as the message is handled by the library. This could that the message has been copied to an internal buffer or that the message was actually transferred to the remote process and has been received. Either way, the message has to go somewhere. If you don't have a receive buffer already waiting, the message has to be buffered internally. Eventually, the implementation will run out of those buffers and starts to do bad things (like return errors to the user), which you are probably seeing here.

The solution is to pre-post your receive buffers. In your case, you can just push all of isend and irecv calls into the same vector and let MPI handle everything. That will give MPI access to all of the receive buffers so your messages have somewhere to go.