Question

I want to use Vtune Profiler APIs to profile a code running on Xeon Phi (Linux, using offload execution) to see the number of instructions executed, the number of L1 cache misses, etc. But I can't find anywhere explaining how to use this library.

Where to find the library files and include files in Linux? How do I write a code to profile a short code running on Xeon Phi?

I would expect something like this:

//this code will be executed on host processor
Read_counters();
Code_to_run on Xeon Phi
Stop_counters();
Print results();

Thanks

Was it helpful?

Solution

There is no such kind of API support in VTune. Use Knights Corner Platform Analysis in VTune GUI or command line - it runs your program on host, but collects hardware counters only from the Xeon Phi card. I.e. as result you should see performance metrics for only offload code.

You may also find this article useful for interpreting results: http://software.intel.com/en-us/ARTICLES/OPTIMIZATION-AND-PERFORMANCE-TUNING-FOR-INTEL-XEON-PHI-COPROCESSORS-PART-2-UNDERSTANDING

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top