No, We Don't Have to Choose Batch Sizes As Powers Of 2

Sharing Deep Learning Research Models... A Short Chronology Of Deep Learning F...

No, We Don't Have to Choose Batch Sizes As Powers Of 2

Regarding neural network training, I think we are all guilty of doing this: we choose our batch sizes as powers of 2, that is, 64, 128, 256, 512, 1024, and so forth. There are some valid theoretical justifications for this, but how does it pan out in practice? We had some discussions about that in the last couple of days, and here I want to write down some of the take-aways so I can reference them in the future. I hope you'll find this helpful as well!

View more on Sebastian Raschka's website »

Like • 0 comments • flag

Published on July 05, 2022 00:00

No comments have been added yet.

Sebastian Raschka's Blog

Sebastian Raschka's profile
149 followers

Sebastian Raschka isn't a Goodreads Author (yet), but they do have a blog, so here are some recent posts imported from their feed.

Follow Sebastian Raschka's blog with rss.

delete edit this post