If you can, switch to some other C compiler, basically gcc
from mingw32. With gcc, you can use this neat trick to replace library functions with your own code, as explained in this answer. Also look if if Visual C can do something similar!
(A side note: If you're stuck with Windows, then I personally find it easiest for C development to just grab the Qt SDK for mingw, with Qt Creator included, then use that for plain C project, which is supported out-of-the-box.)
Alternatively, grab sources of C math library, for example the ones from this answer, edit them to include delay, and link against that instead of standard math library. With gcc you would simply not use -lm
linker switch, instead link against the custom lib like any other lib. With VC it could be as simple, but it is prossible math lib is linked in by default, and in that case you need to find out how to disable this default linking.
Also, make sure to disable any optimizations, which may create custom inlined code for math library functions! Though just compiling in debug mode is likely to achieve this.
Note that your profiling results will be very rough. You probably should calibrate things a bit, both for general C optimization settings, as well as measuring how much delay you need to add to any floating point instructions. But if profiling the real code on the real device is not an option, and this is not just for one project but will be useful in future too, then I can see some benefit in setting up some kind of comparable environment for native PC code.