fit
is using partial_fit
internally, so the learning rate configuration parameters apply for both fit
an partial_fit
. The default annealing schedule is eta0 / sqrt(t)
with eta0 = 0.01
.
Edit: this is not correct, as seen in the comments the actual default schedule for SGDClassifier
is:
1.0 / (t + t0)
where t0
is set heuristically and t
is the number of samples seen in the past.