Java compiler behaviour during narrowing primitive conversion

Question 1

It's easy enough to test it out. Put the following in Temp.java:

class Temp {
  public static void main(String[] argv) {
    byte b = 27;
    System.out.println(b);
  }
}

Now compile it with your favorite compiler:

$ javac Temp.java

Now dump the bytecode with javap:

 $ javap -c Temp.class
 Compiled from "Temp.java"
  class Temp {
    Temp();                                                                                                                             
      Code:
         0: aload_0
         1: invokespecial #1                  // Method java/lang/Object."<init>":()V
         4: return

    public static void main(java.lang.String[]);                                                                                        
      Code:
         0: bipush        27
         2: istore_1
         3: getstatic     #2                  // Field java/lang/System.out:Ljava/io/PrintStream;                                       
         6: iload_1
         7: invokevirtual #3                  // Method java/io/PrintStream.println:(I)V
        10: return
  }

Now replace 27 with (byte)27 and run again. You'll see there is no difference. In fact the two classfiles will have the same md5sum.

There is no runtime cast in the bytecode because the compiler figured out it wouldn't be needed, and optimized it away.

I believe you're correct that syntactically the line byte b = 27 differs from the line byte b = (byte) 27, but they are semantically the same, because all standard compilers are smart enough to optimize the line into a single bytecode.

Question 2

Before we start, it's important to note that in java, all purely numeric literals are int values.

The key phrase regarding allowable un-cast constants is is representable in the type of the variable. That just means the constant "is in range" of the variable type, so:

for byte -128 to 127
for short -32768 to 32767
for char 0 to 65535

Values "in range" won't "lose information" if cast to the variable type. That explains why in-range constants are allowed.

For values outside of the range, an explicit cast is required because information would be lost if a cast was performed. When a narrowing cast is done, bits outside the scope of the variable type are simply masked off - that's what "losing information" means in these cases.

It's as if an assignment of a constant is like:

byte b = nnn & 0xFF;

If nnn is within range, then the mask will not change the value, so nothing lost, so no problem - compiles OK.

If nnn is out of range, information would be lost, so require an explicit cast to acknowledge the loss.

If you recall that all integer literals are ints, the rules are actually no different than those that apply to assigning an int to narrower variable type, except the compiler allows no cast if it knows the value will "fit",