No, they're claiming the specific LLMs tested are bad at it. They published thei... | Hacker News

Hacker News new | past | comments | ask | show | jobs | submit

login

		nitwit005 1 day ago \| parent \| context \| favorite \| on: Salesforce study finds LLM agents flunk CRM and co... No, they're claiming the specific LLMs tested are bad at it. They published their code. If you have an agent you think will do better, run it with their setup.

CityOfThrowaway 1 day ago [–]

Situationally, the original post claims that LLM Agents cannot do the tasks well. But they only tested one agent and swapped out models.

The conclusion here is that the very specific Agent that Salesforce built cannot do these tasks.

Which frankly, is not a very interesting conclusion.

Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact