https://github.com/NanoNets/object-detection-sample-python/blob/03e632dd76d148ced9f61503371d4554c133d2ff/code/upload-training.py#L13 VOC format gives output in xml format and many open source tools/datasets have xml format annotations.