Neural network with single positive non limited output value

Question 1

Here are a few suggestions:

Make sure you are using a mean squared error on the output, no activation function needed: err(x,f(x))= (x-f(x))^2, where x is the true output and f(x) is the predicted output.
If you implemented back propagation yourself, track the progress of the average error of the network for each epoch. If the error increases between epochs, then this indicates that your implementation is incorrect. You could also check the gradients computed by backpropagation by comparing with approximate gradients computed using finite differences.
Normalize your input and output: subtract the means and divide by the standard deviation or whiten your data using PCA

Resources:

Question 2

You are asking a very vague question on a very broad topic but relating to your very concrete and individual problem. You'll probably not get a helpful answer this way but I'll give it a try anyway.

By your description it seems like you are new to the field of neural networks and would like to just throw it on your classification problem. As you might infer from the length and depth of the wiki page on neural networks there are lots of dificulties involved in making neural networks "just work". I recommend studying the topic a bit before expecting insightful results. You'll have a much better idea where to look for what's wrong with your approach.

Here is my list of things you should pay attention to when using neural networks for classification:

Data sets for training and validation
Input encoding and normalization
Neuron types (activation function)
Network topology
Output encoding and "denormalization"
"Fitness" evaluation function
Training algorithm

My best guess is that you got a good amount of those right already but your topology is insufficient. Topology is what makes or breaks the ability to learn stuff! You might want to add more neurons in the hidden layer. Also you might want to use multiple output neurons instead of one. In your case you might have real estate value categories ($0-10k, $10k-50k, $50k-100k, etc.) so you could use one output neuron per category. Those categories might be easier to learn than a precise analog estimate.

My second best guess is that there simply is no pattern in your input data. If you were presented one input vector, would you be able to estimate the correct real estate value? How do you do it?

If none of my guesses are helpful, it seems like something with your training process is not working correctly. Investigate by applying your implementation to popular and well understood training problems on the internet to gain more insight.