Assuming vectors of 16 x 1 byte elements, you would set up a mask vector where each element is either all 0s (0x00
) or all 1s (0xff
) to determine whether the element should be stored on not. Then you need to do the following (pseudo code):
init mask vector = 0x00/0xff in each element
init source vector = data to be selectively stored
load dest vector from dest location
apply `vbslq_u8` (`vbit` instruction) with dest vector, source vector and mask vector
store dest vector back to dest location