Why is this behavior allowed in the Java Memory Model?

Question 1

As explained, the only values ever written to x are 0 and 42. Thread 1:

r3 = x; // here we read either 0 or 42
if (r3 == 0)
  x = 42;  
// at this point x is definitely 42
r1 = x;

Therefore the JIT compiler can rewrite r1 = x as r1 = 42, and further y = 42. The point is, Thread 1 will always, unconditionally write 42 to y. The r3 variable is in fact redundant and could be completely eliminated from the machine code. So the code in the example only gives the appearance of a causal arrow from x to y, but detailed analysis shows that there is in fact no causality. The surprising consequence is that the write to y can be committed early.

A general note on optimization: I take it you are familiar with performance penalties involved in reading from the main memory. That is why the JIT compiler is bent on refusing to do it whenever possible, and in this example it turns out that it doesn't in fact need to read x in order to know what to write to y.

A general note on notation: r1, r2, r3 are local variables (they could be on the stack or in CPU registers); x, y are shared variables (these are in the main memory). Without taking this into account, the examples will not make sense.

Question 2

Compiler can perform some analyses and optimizations and end with following code for Thread1:

y=42; // step 1
r3=x; // step 2
x=42; // step 3

For single-threaded execution, this code is equivalent to the original code and so is legal. Then, if the code of Thread2 is executed between step 1 and step2 (which is well possible), then r3 is assigned 42 also.

The whole idea of this code sample is to demonstrate the need of proper synchronization.

Question 3

Its is worth nothing that the javac doesn't optimise the code to a significant degree. The JIT optimises the code but is fairly conservative about re-ordering code. The CPU can re-order execution and it does this to small degree quite allot.

Forcing the CPU to not do instruction level optimisation is fairly expensive e.g. it can slow it down by a factor of 10 or more. AFAIK, the Java designers wanted to specify the minimum of guarantees needed which would work efficiently on most CPUs.