extract features from parts of one image

I have several parts of one image that have one caption... I need to do image captioning by evaluating every part of the image to which the caption will belong so do I need to extract the features from parts of the image and pass it to the model with its caption ? or how can I do it please?

for example; the dataset I have are the parts of the image which are divided into three parts “beach, sea, dog”, and the caption I have for the three parts is one caption which is a dog at the beach . is there any way to extract features for those parts as one image? does concatenate features will be the same feature as the original image

I need to evaluate every part of the image to which the caption will belong

Topic pytorch keras image-classification feature-extraction

Category Data Science

About

Geeks Mental is a community that publishes articles and tutorials about Web, Android, Data Science, new techniques and Linux security.