Can input matrices also be used to store the output matrix with CUBLAS?

https://stackoverflow.com/questions/16329997

c
cuda
gpgpu
cublas

13-04-2022
|

Frage

For instance, cublas<t>geam() will do:

C = αop(A) + βop(B)

But what if I want to store the result in A anyway? Can I call it with pointers *C = *A so that:

A = αop(A) + βop(B)

without fear that I may be writing output to a matrix still being read as an input??

If so, are guaranteed that we do this with all other CUBLAS matrix operations safely?

Lösung

The answer is alright in the cublas documentation, as you posted it - cublasgeam() - If C overlaps A or B, then behaviour is undefined. Nvidia won't guarantee that this will work, if C == A.

Lizenziert unter: CC-BY-SA mit Zuschreibung

Nicht verbunden mit StackOverflow