any mechanism in Java to provide compile-time code variants?

Question 1

I think you have a few options:

Put a test in on a static final constant (e.g. a boolean), which conditionally executes the the visual interface code. The JVM will eliminate the dead code when you set the constant to "off", and your code will be fully optimised. The downside is, of course, that you can only switch this at compile time, but this may be fine if you actually want to build two copies of the library.
Add an extra parameter to the function that determines whether the visual interface is called, and test this where necessary in your algorithm. This will add a small amount of runtime overhead, but may well be acceptable. I suggest you benchmark this, in my experience though such tests on a local variable are usually sufficiently cheap that you can get away with it (in CPU terms, will probably be just a register test which is likely to be even cheaper than the cost of a single memory access into an int[] array...)
Use a higher level / meta-language to express the algorithms, and use code-generation techniques to generate the actual code you need (with or without). I've done stuff like this in Clojure, for example. It's also an option to generate bytecode directly with tools like ASM (if all you care about is execution, and don't need the Java source code).
Use a textual pre-processor. It should work fine for Java code generation (as it does for C/C++), though it's not such a common approach and you may find it a bit fragile. You may need to do some clever stuff like generating different class names in the file system etc.

Question 2

Contrary to e.g. C++, the final compilation to native machine code is done at runtime, in this case completely eliminating the need for building two separate versions for performance reasons.

If you pass the boolean to enable/disable the extra calls as a parameter to the constructor of the class that implements your algorithm and store it in a final class variable (i.e. a constant), when the algorithm gets executed in a tight loop (= a 'hot spot') the Hotspot VM will compile the class instance and remove the dead code. This kind of runtime optimizations can't be done with C++.

But note that a boolean test is likely to cost only a very small fraction of the total algorithm.

EDIT: your tests have shown that this doesn't work, although I'm not sure they are done correctly. You are not using any benchmarking framework. The most aggressive optimizations will happen with the server VM (-server) and then code must be properly warmed up first (the first 10000 or so iterations will happen uncompiled which is of course much slower). Also, using a template pattern may have better chance of being optimized than a final boolean, as the boolean check is cheap anyway and the compiler is known to do virtual call inlining (as far as I know).

EDIT2: if you don't need to switch at runtime (after all conditional compilation and separate builds won't help you there either) just go with the static final boolean which you know gets optimized. Initialize it with the value from a command line argument or config file and you can easily switch between both versions at application start time.

Question 3

If you do:

private static final ENABLED = false;

// Then later...
if(ENABLED){
    call();
}

The whole if-block will not be even included in the generated bytecode (at least on newer JVMs). Would that be an option?

Question 4

The reason why Java doesn't have a preprocessor is to prevent programmers from doing exactly what you are trying to do. Instead, you should write Java code to implement the functionality you would otherwise implement in the preprocessor directives and let the compiler optimize the generated code. This way, you will end up with code written in a single language (instead of two languages, Java and preprocessor DSL), making things a lot easier for the toolchain you use when analyzing your code.

One way to solve your problem in pure Java code is to use the Template method pattern.

Question 5

As this guy claims to prove, it is possible to modify the bytecode to set a final variable on runtime. I guess you could use the same aproach to set static final VISUALIZE on and off.