Approach fpr extracting/cropping features images using deeplearning and no annotations
Let's say I want to have a bunch of images of hats from videos. How would I priniciple build something that would learn to recognize, and crop or bound box hats? I heard you need a dataset with bounding boxes manually drawn for training, but it seems there would be a way for a nn to identify that on their own?
I'm trying to understand the possibility of scraping video for different items. I.e. give it images of 1000 hats, and then it will crop out images of hats from a bunch of video files.
I am thinking this could be an interesting thing to work on, but would need some advicein terms of how to arrpoach it.
Also, the next logical thing is then to put hats on people in movies somehow, but that would be phase 2.
Thanks
Topic tensorflow convolution computer-vision deep-learning machine-learning
Category Data Science