What guarantees for CUDA CC3.x - all threads of one Warp or only of half-Warp always synchronized?

Question

The concept of half-warp applies to devices of cc1.x only. Perhaps you are simply referring to a part of a warp.
There is in fact no guarantee in the CUDA programming model that threads of a warp will be executed in lockstep. However all extant implementations currently do this, and so there is a considerable code base that takes advantage of warp-synchronous behavior.
Upon entry into a possibly divergent control structure (e.g. if-then-else) the conditional test will be performed for all threads in the warp. If necessary, the warp will then be partitioned into the threads that satisfy the then path and the threads that satisfy the else path. All threads begin executing one of the two paths, but the threads that did not satisfy that path remain idle (perform no operations.) When the execution of that path is complete, the warp is restarted down the other path, and the (previously idle) threads will now execute the remaining path while the (previously active) threads remain idle. This is a general description of the behavior in divergent control flow situations, and it roughly lines up with your paragraph that begins with Or the second half-Warp threads will be inactive(disabled) ... but I would not use the term half-Warp to describe it, as that generally means something with respect to cc1.x devices.
For devices capable of multiple instruction issue, the instructions issued to the warp will be from the currently executing path (then or else not both).