Question

Is there a way to estimate the remaining time when fitting a model? For example

model = sk.ensemble.RandomForestRegressor(n_estimators=10)
model.fit(x, y)

I have a quite large dataset (millions of rows), this is going to take some time so I would like to know estimated time so I can do other thigngs and get back when the process is finished.

With ensembles like random forest estimation of remaining time should be [reasonably] easy.

Was it helpful?

Solution

Try verbose option. You can change it from 0 (no output), 1 (update for each job), and 2 (update for each tree), e.g.

model = RandomForestRegressor(n_estimators=100, verbose=2, n_jobs=2).fit(X_train, y_train)
Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top