Sincere question - why doesn't RL-based fine-tuning on top of LLMs solve this or... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		blueyes 5 months ago \| parent \| context \| favorite \| on: Ask HN: Any insider takes on Yann LeCun's push aga... Sincere question - why doesn't RL-based fine-tuning on top of LLMs solve this or at least push accuracy above a minimum acceptable threshhold in many use cases? OAI has a team doing this for enterprise clients. Several startups rolling out of current YC batch are doing versions of this.

InkCanon 5 months ago [–]

If you mean the so called agentic AI, I don't think it's several. Iirc someone in the most recent demo day mentioned ~80%+ were AI

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact