I did some side-by-side comparisons of simple tasks (e.g. "Write a WCAG-complian...

IanCal · on Dec 6, 2023

Bard with pro is apparently text only:

> Important: For now, Bard with our specifically tuned version of Gemini Pro works for text-based prompts, with support for other content types coming soon.

https://support.google.com/bard/answer/14294096

I'm in the UK and it's not available here yet - I really wish they'd be clearer about what I'm using, it's not the first time this has happened.

sinuhe69 · on Dec 6, 2023

You can ask Bard directly! Unlike ChatGPT, Bard can answer many things about itself.

IanCal · on Dec 6, 2023

It lies:

https://imgur.com/a/glPmXp3

I ask it if it's available in the uk and it says no. I say I'm in the uk and it tells me it's not Gemini then.

aaronharnly · on Dec 6, 2023

Huh! It has an image upload, and gives somewhat responsive, just not great, responses, so I'm a bit confused by that. So this is the existing Lens implementation?

staticman2 · on Dec 6, 2023

Bard has been capable of handling images for months.

IanCal · on Dec 6, 2023

Is palm2 multimodal?

a_wild_dandan · on Dec 6, 2023

As it should! Hopefully Gemini Ultra will be released in a month or two for comparison to GPT-4V.

xfalcox · on Dec 6, 2023

I'm researching using LLMs for alt-text suggestion for forum users, can you share your finding so far?

Outside of GPT-4V I had good first results with https://github.com/THUDM/CogVLM

IanCal · on Dec 6, 2023

As a heads up, bard with gemini pro only works with text.