Why might pipelining work better for (a+b)+(c+d) than for a+b+c+d?

Question 1

a+b and c+d can be calculated in parallel.

Like this:

x = a+b
y = c+d
return x+y // requires x and y

vs

x = a+b
y = x+c // requires x
return y+d // requires y (and thus x)

When calculating y one has to wait for the result of x to come in first, there is a data dependency between them. See Instruction-level parallelism on Wikipedia.

Question 2

With unsigned int? It wouldn't. Integer operations can be reordered freely without any risk of affecting the result, so any half-decent compiler should generate the same code for both expressions, because they only mean something different when discussing floats.

Question 3

If your compiler generates an intermediate SSA, it might come out looking like:

AB = a + b;
ABC = AB + c;
ABCD = ABC + d;

in the first case, and:

AB = a + b;
CD = c + d;
ABCD = AB + CD;

In case 1, each term includes the previous term, so even if the ALU is capable of adding multiple terms at once, it has to wait for the result of the previous operation to start the next. In case two, a processor like a modern x86 with multiple ALU pipelines can calculate AB and CD independently simultaneously.