Question

Backpropagation calculates dW (weight delta) per weight per pattern, so it's straightforward how to modify weights when doing stochastic training. How do I use it for batch training, though? Simply accumluate dW over the entire training set and then apply the modfication, or is there more to it?

Was it helpful?

Solution

Yes, just accumluate dW over the entire training set. At least that is how I coded it back in grad school...

OTHER TIPS

You can do a lot with the different gradients from the different samples. That includes higher order information (approximate 2nd derivative) or conjugate gradient or natural gradient or ... :)

Licensed under: CC-BY-SA with attribution
Not affiliated with StackOverflow
scroll top