I'm curious is the batch_size equal to 20 enough for training? should I try batch_size=128 or 256?
I'm curious is the batch_size equal to 20 enough for training? should I try batch_size=128 or 256?