tadala's comments

tadala · 2024-10-04T10:26:03 1728037563

Everyone wants to use less compute to fit more in, but (obviously?) the solution will be to use more compute and fit less. Attention isn't (topologically) attentive enough. All these RNN-lite approaches are doomed, beyond saving costs, they're going to get cooked by some other arch—even more expensive than transformers.

falcor84 · 2024-10-04T10:37:08 1728038228

Would you mind expanding upon your thesis? If that compute and all those parameters aren't "fitting" the training examples, what is it that the model is learning, and how should that be analyzed?

ithkuil · 2024-10-04T12:06:51 1728043611

I think there are two distinct areas. One is the building of the representations, which is achieved by fitting. The other area is loosely defined as "computing" which is some kind of searching for a path through representation space. All of that is wrapped in a translation layer that can turn those representations into stuff we humans can understand and interact with. All of that is achieved to some extent by current transformer architectures, but I guess some believe that they are not quite as effective at the "computation/search" stage.

falcor84 · 2024-10-04T14:35:12 1728052512

But how does it get good at "computing"? The way I see it, we either program them to do so manually, or we use ML, at which case the model "fits" the computation based on training examples or environmental feedback, no? What am I missing?

ithkuil · 2024-10-04T14:55:44 1728053744

the distinction is fuzzy indeed, especially if any thing that you "program in manually" has some parameters that are learned.

Conceptually we already have parts of the model that are not learned: the architecture of the model itself.

tadala · 2024-06-20T00:34:58 1718843698

Is this a popular view? I think mathematicians can be odd, but usually they communicate quite well. I think as far as popularization of their fields go, mathematics is probably doing the best out of the lot: numberphile, 3blue1brown etc.

xanderlewis · 2024-06-20T01:32:05 1718847125

The examples you list are not known as mathematicians; they’re popularisers who (sometimes) happen to have qualifications and a history of studying the subject. 3B1B is absolutely brilliant but Grant Sanderson is not a ‘mathematician’ in the sense of someone who does research in mathematics.

Ironically, the fact that mathematics popularisation is as visible as it is is itself a sign of how much it is needed and therefore how unpopular and misunderstood the subject is. Branches of science like, say, astrophysics don’t need popularisation; people already think they’re cool.

The view of ‘people who are good at mathematics’ being bad at English is a relatively common one, in my experience. At least at the level of university students. People think there’s some sort of conservation of ability or equilibrium in the universe that means that if you have a ‘maths brain’ then you’re no good at much else, and vice versa. If anything, I think there’s a positive correlation between mathematical and communication ability — after all, mathematics is basically just the science of clever notation and clear-headed thinking.

tadala · 2024-06-10T22:40:10 1718059210

You could fill a form and request them not to train; they usually approved it fairly quickly, but did not advertise it well enough!

tadala · 2024-06-02T18:02:36 1717351356

Shocking comment. Do you think the scientific method is inbuilt in our DNA or something? Where do you think it all comes from?

bbarnett · 2024-06-02T18:12:13 1717351933

You think it comes from philosophy? This is like claiming our democracy comes from God, because he edictorially spoke through ancient kings!

djur · 2024-06-02T18:18:45 1717352325

The term "scientific method" is itself a philosophical term (as is "method" in this context). Read, or even skim, this and notice how many of the important figures listed were philosophers:

https://en.wikipedia.org/wiki/History_of_scientific_method

tadala · 2024-06-02T18:01:33 1717351293

On the other hand, the other response to OP's comment is a perfect display of what he means. A lot of tech bro hubris/idiocy.

ryukoposting · 2024-06-02T19:11:57 1717355517

In that case, let me go a step further: although I wouldn't respond the way some other folks have, I get why they would. Many of my most memorable and most intellectually stimulating classes were those that weren't related to my engineering degree. The philosophy classes, though, never even approached "intellectually stimulating" status. I wrote a good 80-100 pages of pseudointellectual drivel about half-baked analogies like the "answering machine paradox," and accrued thousands in debt in the process.

Another thing: The great thing about Philosophy is that there are no wrong answers. But, the bad thing about philosophy classes is there are wrong answers. Open-endedness and free thinking don't scale to 150-seat lecture halls, indifferent TAs, and PhD-candidate "professors" doing the bare minimum to get a diploma.

tadala · 2024-05-28T11:02:04 1716894124

Not within mathematics, where it is the entire sport, and which is the point of contention.

psychoslave · 2024-05-28T12:50:40 1716900640

If there is one space where it shines, sure it’s mathematics. But even there, the most notable mathematicians highly rely on some intuitions far before they manage to prove anything, as well as while selecting/creating their conceptual tools to attempt to build the proof, and rarely go to the point of formalizing their points through Coq/Isabelle or even with meticulous paper craft à la Principia Mathematica from Russel and Whitehead.

calf · 2024-05-29T16:59:57 1717001997

Except humans correctly believe that a Coq proof is theoretically correct whereas an LLM does not have this meta reasoning ability at all.

tadala · 2024-04-25T14:10:14 1714054214

Ah the nature vs nurture debate, we meet again!

Give me a Neural Net in its first epoch and I shall mold it into anything!

tadala · 2024-04-22T18:03:55 1713809035

Not a typo but an incorrect use of that word.