I am using a generator in my LSTM keras model. In the generator I would like to shuffle samples using shuffle = True. My question is that when I shuffle data, what should be the number of steps_per_epoch so that during one epoch the model could be trained based on the entire train set. Since samples in batches are shuffled, using the traditional number total_num_samples/batch_size would not allow us to make sure that we use the entire training data over one epoch. Thanks for your thoughts in advance.
generator
shuffle = True
steps_per_epoch
epoch
total_num_samples/batch_size
1.4m articles
1.4m replys
5 comments
57.0k users