Rescheduled jobs will only get run in queues that have their rerun
attribute (FALSE by default) set to TRUE, so check your queue configuration (qconf -mq myqueue
). Without this, your job remains in the rescheduled-pending state indefinitely because it has nowhere to go.
IIRC, submitting jobs with qsub -r yes
only qualifies them for automatic rescheduling in the event of an exec node crash, and that exiting with status 99 should trigger a reschedule regardless.