The accuracy of object detectors and trackers is most commonly evaluated by the Intersection over Union (IoU) of the tracker prediction and the ground truth. In all of the common tracking benchmarks, the ground truth is restricted to axis-aligned or oriented boxes. To help evaluate the accuracy of trackers more precisely, we present a toolkit which works with ground truth segmentations. To gain a perspective on how well all approaches restricted to boxes can perform, we present upper bounds for all box-based trackers of the Visual Object Tracking (VOT) and Densely Annotated Video Segmentation (DAVIS) challenges. The toolkit is easy-to-use, and arbitrary trackers from Python, Matlab, or HALCON can be added.

