I did some side-by-side comparisons of simple tasks (e.g. "Write a WCAG-compliant alternative text describing this image") with Bard vs GPT-4V.
Bard's output was significantly worse. I did my testing with some internal images so I can't share, but will try to compile some side-by-side from public images.
> Important: For now, Bard with our specifically tuned version of Gemini Pro works for text-based prompts, with support for other content types coming soon.
Huh! It has an image upload, and gives somewhat responsive, just not great, responses, so I'm a bit confused by that. So this is the existing Lens implementation?
Bard's output was significantly worse. I did my testing with some internal images so I can't share, but will try to compile some side-by-side from public images.