Extract features using Bounding Box
I have a ground truth bounding box for a 3d object. I would like to extract useful features for the object. My goal is to concatenate these visual object features with language features (from the description of the object) for training.
For visual features, I want to use the ground-truth bounding boxes to create an upper baseline.
How do I extract features using 3d bounding box?