More

E-Reverance · 2026-01-12T19:12:00 1768245120

> Residual connections are more than a trick to help gradients flow. They’re a conservation law.

> Not a hack, not a trick. A principled constraint that makes the architecture work at scale.

jszymborski · 2026-01-12T21:54:57 1768254897

OK, I thought I was reading too much into it but those same sentences also jumped out for me

roywiggins · 2026-01-12T23:25:09 1768260309

pangram thinks the whole thing was LLM generated fwiw, as dodgy as AI detectors are it is probably among the best. I don't doubt the author started with their own text, but I think it's been substantially revised via ChatGPT

DoctorOetker · 2026-01-12T21:41:24 1768254084

yes this reads like classic intellectual fellicitatio

E-Reverance · 2026-01-09T02:21:10 1767925270

> blowing up kids

not to refute the difference in extent but this is somewhat notable https://en.wikipedia.org/wiki/Dahyan_airstrike

E-Reverance · 2026-01-09T00:10:51 1767917451

How was he equally brutal

E-Reverance · 2025-12-29T02:44:13 1766976253

I haven't tried this myself and this might be absurd, but attending PhD defences might be an interesting way to meet new people

E-Reverance · 2025-12-23T04:24:53 1766463893

It should be noted that this is NOT the official scores on the private evaluation set

viraptor · 2025-12-23T12:03:39 1766491419

Here it matters much less than in generic LLMs though. There's no chance of test set leakage since the network is not general purpose / not trained on the internet.

E-Reverance · 2025-12-21T06:15:56 1766297756

I didn't know how to title this. I definitely don't believe his proof claims but I found this whole event to be psychologically interesting

bigyabai · 2025-12-21T06:26:59 1766298419

> Opus 4.5 likes it

And to think, people said peer review in academia is dead.

E-Reverance · 2025-12-15T17:17:58 1765819078

One can care about both

E-Reverance · 2025-12-05T20:28:10 1764966490

> I do actually believe that zero teenagers should make banking apps or run non-profits.

That sounds like a lot of fun and should be a pretty social experience.

Also I'm going to assume his parents are proud, which should put his family at ease.

E-Reverance · 2025-11-25T18:16:53 1764094613

Surprised there wasn't any mention of Equilibrium Matching [1] in the future work section

[1] https://raywang4.github.io/equilibrium_matching/

E-Reverance · 2025-11-02T21:08:08 1762117688

Just for reference, the main author's stance on god : https://youtu.be/k_VBzweMIlM?t=125