Question About Discriminator of CycleGan

The Discriminator of CycleGan outputs not just a single value to say that the image is real or fake.... But It outputs a grid of numbers (like 8X8 or 7x7), where each number says whether one patch of the input image is fake or real.

So, my question is that, why do we do this.. What benefits does it gives us, and what was the problem in the approach where we only outputted a single value?

Topic cyclegan gan deep-learning neural-network machine-learning

Category Data Science


This is explained in the original CycleGAN paper:

For the discriminator networks we use 70 × 70 PatchGANs, which aim to classify whether 70 × 70 overlapping image patches are real or fake. Such a patch-level discriminator architecture has fewer parameters than a full-image discriminator and can work on arbitrarily sized images in a fully convolutional fashion.

Therefore, the arguments for using patch-based discriminators are that:

  • It has fewer parameters.
  • It enables using them in arbitrarily sized images.

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.