Formatting if try except else code blocks

Question 1

How about with a decorator:

import os
import pickle
import functools

PICKLE = False
PICKLE_PATH = '/tmp'

def checkpoint(f):

    if not PICKLE:
        return f

    save_path = os.path.join(PICKLE_PATH, '%s.pickle' % f.__name__)

    @functools.wraps(f)
    def wrapper(*args, **kwargs):
        if os.path.exists(save_path):
            with open(save_path, 'rb') as f:
                return pickle.load(f)

        rv = f(*args, **kwargs)
        with open(save_path, 'wb') as f:
            pickle.dump(rv, f)

        return rv

    return wrapper

Usage:

@checkpoint
def step1():
    return do_stuff_here()


def intermediate_step():
    return some_operation(step1())

@checkpoint
def step2():
    return do_stuff_with(intermediate_step())

# ... and so on

Question 2

Why not extract it further into a function to avoid all the repeating code?

def pickle_function(pickle_filename, data_function):
    with open(pickle_filename, 'wb') as f:
        try:
            data = pickle.load(f)
        except:
            data = data_function()
            pickle.dump(data, f)

if PICKLE:
    pickle_function('pickle1.pkl', generateData1)

# Some intermediate logic before next 'checkpoint'

if PICKLE:
    pickle_function('pickle2.pkl', generateData2)

Also, I'm not sure what Exception you're catching when opening files so you may have to reorganise if the file may not exist. It's always a good idea to catch specific Exceptions (e.g. except FileNotFoundError:) so that any unexpected behaviour is raised loudly.

Question 3

You might also get away from code repetition with a while syntax instead of repeated if/elses.

So, as a really basic example that doesn't necessarily intend to inform you on your workflow, you have a function that handles what to do with the data in question.

def change_data(previousdata, iteration):
    if iteration == 0:
        ##some change
        return new_value
    elif iteration == 1:
        ##some other change
        return new_value
    …
    elif iteration = total_needed ##however many different tests there are
        indicate_doneness() ##whatever this means for you

And you have those suggested 'load from pickle, OR create data and dump it' function.

def pickle_or_dont(args):
    try: ##the suggested code from other answers

Then set up a while loop to track how many iterations have been done and which 'stage' you're at. This eliminates your need to repeat code.

total_needed = 7 ##or however many 
data_generated = 0
while data_generated < total_needed:
    my_data = change_data(my_data, data_generated)
    pickle_or_dont(my_data)
    data_generated += 1

My sense of your intended order of operations may not be correct, you will know better than I. BUT I do think a while loop will keep you from repeating code.