Comment être interprété le code, même peu efficace? (théorique)

https://stackoverflow.com/questions/3630867

26-09-2019
|

Question

Ok, d'abord, je ne veux aucun type de flamme ici ou quelque chose comme ça. Ma plus grande question est plus théorique et comprendra peu d'exemples.

Donc, comme je l'ai écrit, je ne peux pas comprendre comment le langage interprété peut-il être même peu efficace. Et depuis son moderne, je prends Java comme exemple.

Revenons à des jours où il n'y avait pas de compilateurs JIT. Java a sa machine virtuelle qui est essentiellement son matériel. Vous écrivez du code, qu'il n'a compilé en bytecode pour retirer au moins un peu de travail de la machine virtuelle, c'est bien. Mais compte tenu de la complexité même des instructions RISC dans le matériel, je ne peux même pas penser à la façon de le faire chez le matériel émulé par logiciel.

Je n'ai aucune expérience de l'écriture de machines virtuelles, donc je ne sais pas comment c'est fait au niveau le plus efficace, mais je ne peux penser à rien de plus efficace que de tester toutes les instructions pour faire correspondre ADN que faire des actions appropriées. Vous savez, quelque chose comme: if(instruction=="something") { (do it) } else if(instruction=="something_diffrent"){ (do it) }etc....

Mais cela doit être terriblement lent. Et encore, même il y a des articles que Java était lent avant les compilateurs JIT, ils disent toujours que ce n'est pas si lent. Mais pour l'émulation, il doit prendre de nombreux cycles d'horloge de réel HW pour effectuer une instruction de bytecode.

Et pourtant, même les plates-formes entières sont basées sur Java. Par exemple, Android. Et les premières Vérisons d'Android n'avaient pas de compilateur JIT. Ils ont été interprétés. Mais ne devrait pas être terriblement lent Android? Et pourtant ce n'est pas le cas. Je sais, lorsque vous appelez une fonction API, à partir de la bibliothèque Android, elles sont écrites en code machine, donc elles sont efficaces, donc cela aide beaucoup.

Mais imaginez que vous rédigeriez votre propre moteur de jeu à partir de SRATCH, en utilisant l'API juste pour afficher des images. Vous auriez besoin de faire de nombreuses opérations de copie de tableau, de nombreux calculs qui seraient terriblement lents lorsqu'ils étaient imités.

Et maintenant quelques exemples comme je l'ai promis. Comme je travaille principalement avec MCUS, j'ai trouvé JVM pour ATMEL AVR MCU. Thay déclare que 8 MHz MCU peut faire 20k Optcodes Java par seconde. Mais comme AVR peut faire la plupart des instructions en un ou deux cycles, disons la moyenne des instructions 6000000. Cela nous donne que JVM sans compilateur JIT est 300 fois plus lent au code machine. Alors pourquoi devenir Java si populaire sans compilateur JIT? N'est-ce pas une trop mauvaise perte de performance? Je ne peux tout simplement pas le comprendre. Merci.

La solution

We've had byte code around for a long time. On the old Apple II, the USCD p-system was very popular, which compiled Pascal into byte code, which would be interpreted by an 8-bit 6502 that might be running at 2 MHz. Those programs did run reasonably fast.

A bytecode interpreter would generally be based on a jump table rather than a chain of if/then/else statements. In C or C++, this would involve a switch statement. Fundamentally, the interpreter would have the equivalent of an array of processing code, and use the opcode in the byte code instruction as the index of the array.

It's also possible to have byte code that's higher-level than the machine instructions, so that one byte code instruction would translate into several, sometimes numerous, machine code instructions. A byte code that was constructed for a particular language can do this fairly easily, as it only has to match the control and data structures of that particular language. This stretches out the interpretation overhead and makes the interpreter more efficient.

An interpreted language is likely to have some speed penalty when compared to a compiled language, but this is often unimportant. Many programs process input and output at human speed, and that leaves a tremendous amount of performance that can be wasted. Even a network-bound program is likely to have far more CPU power available than it needs. There are programs that can use all the CPU efficiency they can get, and for obvious reasons they tend not to be written in interpreted languages.

And, of course, there's the question of what you get for some inefficiency that may or may not make a difference. Interpreted language implementations tend to be easier to port than compiled implementations, and the actual byte code is often portable. It can be easier to put higher-level functionality in the language. It allows the compilation step to be much shorter, meaning that execution can start much faster. It may allow better diagnostics if something goes wrong.

Autres conseils

But should not then be Android terribly slow?

Define "terribly slow". It's a phone. It has to process "Dial first digit" before you dial the second digit.

In any interactive application, the limiting factor is always human reaction time. It could be a 100 time slower and still be faster than the user.

So, to answer you question, yes, interpreters are slow, but they are usually fast enough, particularly as hardware keeps getting faster.

Remember when Java was introduced, it was sold as a web applet language (replacing and now replaced by, Javascript --- which also interpreted). It was only after JIT compilation that it became popular on servers.

Bytecode interpreters can be faster that a line of if()s by using a jump table:

 void (*jmp_tbl)[256] = ...;  /* array of function pointers */
 byte op = *program_counter++;
 jmp_tbl[op]();

There are two different ways to approach this question.

(i) "why is it OK to run slow code"

As James already mentioned above, sometimes speed of execution is not all you're interested in. For lots of apps running in interpreted mode can be "fast enough". You have to take into account how the code you're writing will be used.

(ii) "why is interpreted code inneficient"

There are many ways you can implement an interpreter. In your question you talk about the most naïve approach: basically a big switch, interpreting each JVM instruction as it's read.

But you can optimize that: for example, instead of looking at a single JVM instruction, you can look at a sequence of them and look for patterns for which you have more efficient interpretations available. Sun's JVM actually does some of these optimizations in the interpreter itself. In a previous job, a guy took some time to optimize the interpreter that way and interpreted Java bytecode was running noticeably faster after his changes.

But in modern JVMs that contain a JIT compiler, the interpreter is just a stepping stone until the JIT does its job, so people don't really spend that much time optimizing the interpreter.

12 MHz would be an ATtiny, which is an 8-bit microprocessor. That means (for example) that a native 'Add" instruction can only add two 8-bit numbers together to get a 9-bit result. The JVM is basically a virtual 32-bit processor. That means its add instruction adds two 32-bit numbers together to produce a 33-bit result.

As such, when you're comparing instruction rates, you should expect a 4:1 reduction in instruction rate as an absolute minimum. In reality, while it's easy to simulate a 32-bit add with 4 8-bit adds (with carries), some things don't scale quite like that. Just for example, according to Atmel's own app note, a 16x16 multiplication producing a 32-bit result executes in ~218 clock cycles. The same app note shows a 16/16 bit division (producing an 8-bit result) running in 255 cycles.

Assuming those scale linearly, we can expect 32-bit versions of the multiplication to take ~425-450 clock cycles, and the division ~510 cycles. In reality, we should probably expect a bit of overhead, which would reduce speed still more -- adding at least 10% to those estimates probably makes them more realistic.

Bottom line: when you compare apples to apples, it becomes apparent that a whole lot of the speed difference you're talking isn't real at all (or isn't attributable JVM overhead anyway).

Licencié sous: CC-BY-SA avec attribution

Non affilié à StackOverflow