Domanda

For instance, cublas<t>geam() will do:

C = αop(A) + βop(B)

But what if I want to store the result in A anyway? Can I call it with pointers *C = *A so that:

A = αop(A) + βop(B)

without fear that I may be writing output to a matrix still being read as an input??

If so, are guaranteed that we do this with all other CUBLAS matrix operations safely?

È stato utile?

Soluzione

The answer is alright in the cublas documentation, as you posted it - cublasgeam() - If C overlaps A or B, then behaviour is undefined. Nvidia won't guarantee that this will work, if C == A.

Autorizzato sotto: CC-BY-SA insieme a attribuzione
Non affiliato a StackOverflow
scroll top