In all likelihood, you're exceeding the size of the heap on the device. You can use a cuda API call to fix this.
cudaDeviceSetLimit(cudaLimitMallocHeapSize, n*100000*sizeof(float));
Make sure you do this before any kernel call though. With that said, you should strongly consider using cudaMalloc once to allocate a single large array instead of doing this.