Hacker News new | past | comments | ask | show | jobs | submit login

Microsoft's can search the open internet. That'd be a long disclosure list!



That's not what training data means


Isn't it considered one-shot learning training it within the context window? There were lots of glowing reports of how well it can do one/few shot learning that way.

I don't think that training on copywritten data is necessarily wrong, just pointing out that doing so within the context window rather than at weight training time might not be so different.


I’m not sure what is meant by this. There’s no “training” happening within the context window, at least not by the commonly-used definition of training, it’s all just part of the input. If you’re asking whether you can reverse-index search copywritten text and feed it into an AI model without permission, that’s been happening for years.


For example, within the context window giving it 5 examples of a problem in a class it has never seen, with answers, and then asking it to solve a sixth was given as one of the amazing few-shot learning examples.

It could potentially do similar searching by the internet for similar things to your question and then figuring out how to derive the answer, without finding an exact answer match directly.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: