Hacker News new | past | comments | ask | show | jobs | submit login
Beating GPT-4 with a 13B model (lmsys.org)
40 points by EvgeniyZh on Nov 14, 2023 | hide | past | favorite | 9 comments



I hate the clickbait title combined with the obvious result....

I have read papers like this which were good, and mostly consisted of fine-tuning, combined with a few tricks to achieve high performance in large but specific domains.

This was a waste of my life. I want the minutes of reading this and of writing this comment back.


It wasn't wasted, thank you for reading it so the rest of us don't.


I love this positive way of looking at it. I wish more people had this as the default - "I did a thing and it was bad/hard, here's my experience so you don't have to do the same."


I would stipulate that the value wasn't "beating GPT-4" but actually "look, just because you can beat GPT-4, doesn't mean you actually delivered value" - and that is pretty important to say


It's possible to say that in two sentences, without a clickbait title.


To me this reads like a cynical commentary on the state of benchmarking, not an attempt to sell a 13B GPT4-Killer


training on a "rephrased the test set is all you need" to ace the test set


This guy calls himself a programmer but couldn’t write code to print 100 random lines from a dictionary?


Programming nowadays is slowly separating from the idea of writing code.

No-code tools has made it way easier to develope things.

Some people who barely know programming are able to create their own ai product using LangChain!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: