I agree that for any given test, you could build a specific pipeline to optimize...

stephc_int13 · 2025-11-18T20:20:39 1763497239

The real strength of current neural nets/transformers relies on huge datasets.

ARC do not provide this kind of dataset, only a small public one and a private one where they do the benchmarks.

Building your own large private ARC set does not seem too difficult if you have enough resources.

egeozcan · 2025-11-19T09:39:21 1763545161

How can they keep it private? It's not like they can run these models locally. Do the providers promise not to peak when they are testing?