How to train a keras model on both original and augmented data from ImageDataGenerator?

I have a dataset that contains about 87000 images in a directory, with each class in a separate subfolder. I've tried the class ImageDataGenerator() and the function flow_from_directory() for generating the images, it worked completely fine but I have a question.. Does flow_from_directory() only yield the augmented images? and if this is the case, how can I train my model which has overfit the training set on both original and augmented data? Thanks

Topic data-augmentation overfitting keras

Category Data Science


Below are the things can be done in order to reduce variance (overfitting):

  1. Add more training data.
  2. Normalization (BatchNorm, LayerNorm)
  3. Data Augmentation
  4. Regularization (Dropout, L2, WeightDecay)
  5. Error Analysis
  6. Tune Hyperparameters
  7. Early Stopping
  8. Use better state of the art model or transfer learning

ImageDataGenerator do augmentation on the fly base on the setting you give , so it did not separate “original” or “augmented “ data , just the possibility of data been augmented. so your question actually is “ how to tune model when overfitting happens ”? I think you can start from adding dropout or reduce number of parameters. Hope this helps, cheers.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.