When using a model like VGG16 as a classifier within Faster RCNN, does Faster RCNN then use 2 CNNs in total?
Im currently doing a project about CNN's but im quite confused because they can be used to classify and to extract features. According to the Faster RCNN paper, it uses a ResNet backbone. I have also seen that you can use for example VGG16 with Faster RCNN to classify,lets say types of vegetables. Does it mean that when I implement it this way, it uses 2 cnn's in total, namely resnet for extracting features of ROI's and then VGG for classifying what these regions actually are?
I tried to illustrate what i mean in the image below, is this how it works or am i completely missing the point?
Topic inceptionresnetv2 faster-rcnn vgg16 cnn
Category Data Science