Mechanize script keeps stopping with `fetch': 503 => Net::HTTPServiceUnavailable

Question 1

Try calling sleep(1) in your each loop. It's very likely that the target server is overwhelmed by all the requests without any pause.

Question 2

My first suspicion is that you are violating the site's terms of service (TOS) and/or their robots.txt file, and their system is temporarily banning you.

Running a spider or crawler at full speed isn't being a good network citizen, so search for their TOS and learn how to load and parse a robots.txt file to play by their rules. Mechanize knows how to deal with robots.txt files but you have to enable it using robots=.

Trying to read 1500 pages at one time, without an agreement with them that it's OK, would be a pretty obvious sack and pillage run, so instead don't hit them so hard. Remember, it's their bandwidth and CPU you're hitting also. Keep hitting them hard and they might ban you permanently which is not what you want.

Question 3

It could be that the server response time is delayed or not responding to your parsing request, that means catching the error may help to continue your request. I had a similar sort of problems before and solve it using TimeoutError. You might want to implement it like this

begin
  status=Timeout.timeout(5){
    #Interrupts if it takes more than 5 secs
  }
rescue Timeout::Error
  #Should read the data from time-out and carry on where it was left off.
end

You might need to make use of Rails.cache.write and Rails.cache.read to store and read the data.