It is an imaginary rectangular box which is an object detection and localisation and computer vision. Bounding box annotations contain coordinates which bear information about an object's location in the image or video. It is most suitable for uniformly shaped objects and those which do not overlap.
Intersection Over Union is a technique in computer vision used to identify as well as locate objects in digital images or videos. Bounding box annotations are used to achieve this purpose through localisation.
When there is excessive overlap between the predicting box and the bounding box, the model is unable to identify the targeted object within a dataset. The less the overlap, the more accurate the bounding box becomes.
In bounding box annotation, it is not advised to start with diagonally shaped items. This is because they naturally take up smaller spaces within the bounding box. The machine model may mistake the actual background as the targeted object, defeating the original purpose of the annotation.
It is advisable to focus on smaller objects because it hardly creates room for errors. Larger images are less accurate when used in bounding box datasets.
Every object of interest needs to be labelled as training models need to identify them all for accurate performance. It will be a catastrophic error to label an image and leave another. This is because models have been built in a way to know which pixel patterns are compliant with an object of interest.
Box size variations must be consistent. This is owing to the fact that large objects usually underperform in datasets, especially when the same type of object looks smaller. Training models with same size objects will also not allow it to function to perfection.
To ensure that pixels are perfectly tight the bounding box edges must touch the most out of pixels of the labelled object. Where gaps are left, a model may be unable to give accurate predictions.
Occluded objects are those which are not in full view because there is an obstruction blocking its visibility. That does not warrant it being left partially labelled or completely unlabelled. It should be annotated just like the fully visible images in the bounding box.
An object of interest should be thoroughly and fully labelled as partial labelling leads to confusion on the part of the model what the full object truly consists of.
It is best to be very specific in annotation rather than being general in outlook. This is because in the event of an error, a very specific label can be adjusted or re-annotated. However, in the event that labelling was general, a need will arise to label the whole dataset again to gain the needed accuracy.
We have a wide range of solutions and tools that will help you train your algorithms. Click below to learn more!