It is possible for it to get pre-empted and moved to another CPU after the acquire, but as far as I'm aware, the O/S has to ensure that any explicit memory ordering is preserved (this is probably something it keeps in the thread state). Otherwise, there'd be very little chance of anything running reliably in a multi-cpu environment.
I think the standard assumes that to be the case, on the basis it has to.