As mentioned in the Previous Answer use CL_INTENSITY and CL_LUMINANCE.
It is expected that the latency of addressing calculations is hidden better for Image objects, but texture cache is not kept coherent with respect to image writes, so any image read to an address that has been written to via an image write in the same kernel call returns undefined data.So developers prefer to use regular buffers only.