Pergunta

I have been reading through Stanford's code examples for their Deep Learning course, and I see that they have computed num_steps = (params.train_size + params.batch_size - 1) // params.batch_size [github link].

Why isn't it num_steps = params.train_size // params.batch_size instead?

Nenhuma solução correta

Licenciado em: CC-BY-SA com atribuição
scroll top