An axis-aligned coarse representation of the detected text's location on the image.
Within the bounding box, a fine-grained polygon around the detected text.
An axis-aligned coarse representation of the detected text's location on the image.