Is there a way to estimate the remaining time when fitting a model? For example

model = sk.ensemble.RandomForestRegressor(n_estimators=10)
model.fit(x, y)

I have a quite large dataset (millions of rows), this is going to take some time so I would like to know estimated time so I can do other thigngs and get back when the process is finished.

With ensembles like random forest estimation of remaining time should be [reasonably] easy.

有帮助吗?

解决方案

Try verbose option. You can change it from 0 (no output), 1 (update for each job), and 2 (update for each tree), e.g.

model = RandomForestRegressor(n_estimators=100, verbose=2, n_jobs=2).fit(X_train, y_train)
许可以下: CC-BY-SA归因
不隶属于 StackOverflow
scroll top