Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I wonder when we’ll reach diminishing returns on new AI models. Haven’t tried Mythos or Fable yet but it seems like Anthropic is already priming itself for this by calling for a “slowdown” of AI development.
 help



I think it's less of a technology problem and more of a money and investment problem. They called for the "slowdown" right when they filed for IPO. This allows them to both triage the money bleed while under the microscope as well as make their competitors look unethical and dangerous if they don't heed the warning to slow down before we lose control of A.I.

At least in cybersecurity, we already have seen diminishing returns with newer models.

At this point the harness/applayer matters more, as different models perform better or worse on exploit classes depending on the prompt, tuning, and various other parameters.

Of course, by the time HN hyperfixates on a topic, it's already been executed on and HN is too late.


Yea, in many usecases the tooling space is increasingly sophisticated context management such as fine tuning domain specific mappings into the model so that it is able to work directly with a compressed form of some data without needing to decompress into the context.

In larger models, these fine tuning techniques work more reliably/robustly. Because of this many usecases tend to prefer larger models. It is possible to work the same behaviour into the smaller model, but it requires more effort. But it's one-time. And smaller models are usually much cheaper. People make a tradeoff along this curve.

This is observed at few-B scale upto hundred-B scale. No way for us non-anthropic/openai to fine tune beyond that of course.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: