I just spent the past 5 hours comparing LLMs

navjack27 · 2024-05-18T12:56:15

Your number one that you bring up shouldn't be a universal. You're basing everything, it seems, upon LLMS as being knowledge retrieval systems. You are dismissing situations where someone might want more creative output without having to guide the LLM with a more strict prompt.

smarri · 2024-05-20T17:21:13

Thanks. Can you share a sample of the questions you used? Were you comparing the LLMs to answers you knew already, or comparing the new answers which were output by the LLMs

atleastoptimal · 2024-05-20T21:47:20

GPT-4T/o is the smartest

Opus has the best vocabulary/apparent creativity

Gemini is the most "neutral"

Venkatesh10 · 2024-05-18T12:48:47

What's the reasoning for the data? Show the reference, table, graph to backup.

karanveer · 2024-05-19T13:21:41

you also recently did a post on reddit, didnt u? you're associated with chatplayground.ai where u did 80k views and got 8 meaningful conversions in the end or so...is that u?

Jimmc414 · 2024-05-19T02:52:45

GPT 3.5 and haiku are great for prototyping

qup · 2024-05-19T10:13:58

3.5 great for summarizing