From TFA, PaliGemma is competitive to GPT-4o and even beats it in terms of speed and OCR accuracy. It also can do object detection (bounding boxes) and segmentation which GPT-4V/o and Claude 3 Opus can't do at all.
Not to mention it's built to be fine tuned and commercially permissive!
Not to mention it's built to be fine tuned and commercially permissive!