More

mrfox321 · 2024-09-29T13:46:20.000000Z

More importantly, he invented diffusion models:

http://proceedings.mlr.press/v37/sohl-dickstein15.pdf

mrfox321 · 2024-09-14T17:02:34.000000Z

China does this, for better or for worse.

daedrdev · 2024-09-14T17:14:29.000000Z

They are not a supportive example, their one child policy doomed their demographics and future economic growth.

ben_w · 2024-09-14T17:30:53.000000Z

They stopped the one child policy nine years ago; they had it in the first place because demographic projections had them severely overpopulated if they didn't.

During the period in which it was active, their GDP increased by a factor of about 62. Not percentage, multiple.

kevin_thibedeau · 2024-09-15T00:42:43.000000Z

They reversed the policy too late. Their rapid growth was them seizing the low hanging fruit upgrading from a country of almost entirely peasants to slightly fewer peasants, all funded by Western greed. Those gains will never happen again.

alephnerd · 2024-09-14T17:05:44.000000Z

Not really.

Entrenched and connected types tend to exit industries before the gavel hits, and enforcement from the CCDI isn't impartial, with plenty of bribery to remain off their radar.

Truly Schumpterian creative destruction is good in a vacuum, but reality isn't a vacuum.

mrfox321 · 2024-08-12T12:01:34.000000Z

Similar, but specialized to softmax(QK)V computation

mrfox321 · 2024-08-11T12:15:15.000000Z

These people want to win. It's not just society wanting to define winners.

oulipo · 2024-08-19T09:46:48.000000Z

They "want to win" because they live in a society where they've been "taught" to want this... but this is not natural

mrfox321 · 2024-05-08T14:19:29.000000Z

Michael Phelps is an alien..

deltarholamda · 2024-05-08T15:27:45.000000Z

Incorrect, he is an evolved dolphin.

badcppdev · 2024-05-08T15:31:21.000000Z

So dolphins are aliens??

el_duderino_ · 2024-05-08T15:56:00.000000Z

Thanks for all the fish!

mrfox321 · 2024-03-05T14:30:06.000000Z

When you need the best possible model, full stop.

E.g. finance

In a sufficiently competitive space, good enough doesn't cut it.

Thrymr · 2024-03-05T18:37:38.000000Z

There is no such thing as "best possible model, full stop". Models are always context dependent, have implicit or explicit assumptions about what is signal and what is noise, have different performance characteristics in training or execution. Choosing the "best" model for your task is a form of hyperparameter optimization in itself.

naijaboiler · 2024-03-05T22:16:50.000000Z

I can’t upvote this enough. Whether in life, or with models, some people really do believe in the myth of absolutely meritocracy

dchftcs · 2024-03-05T14:34:37.000000Z

Do you know of any shop that is running deep learning profitably?

mjhay · 2024-03-05T14:38:04.000000Z

Plenty of places use DL models, even if it's just a component of their stack. I would guess that that gradient-boosted trees are more common in applications, though.

hackerlight · 2024-03-05T14:53:59.000000Z

Do you know what kind of strategies it's seeing use in?

mjhay · 2024-03-05T15:10:00.000000Z

Still mostly NLP and image stuff. Most actual data in the wild is tabular - which GBTs are usually some combination of better and easier. In some circumstances, NN can still work well in tabular problems with the right feature engineering or model stacking.

They are also more attractive for streaming data. Tree-based models can't learn incrementally. They have to be retrained from scratch each time.

dist-epoch · 2024-03-05T17:22:59.000000Z

ML is very good at figuring out stuff like every day at 22:00 this asset goes up if this another asset is not at a daily maximum and the volatility of the market is low.

You might call this overfitting/noise/.... but if you do it carefully it's profitable.

foobar20k · 2024-03-05T15:07:56.000000Z

Real-time parsing of incoming news events and live scanning of internet news sites - coupled with sentiment analysis. Latency is an interesting challenge in that space.

TimPC · 2024-03-05T18:23:26.000000Z

Multiple parts of the iPhone stack run DL models locally on your phone. They even added hardware acceleration to the camera because most of the picture quality upgrades is software rather than hardware.

asdff · 2024-03-05T20:28:07.000000Z

These models usually have poorer fit though

mrfox321 · 2024-02-08T13:00:06.000000Z

I guess op may be envisioning an end-to-end solution that can train a model in the context of an external document store.

I.e. One day we want to be able to backprop through the database.

Search systems face equivalent problems. The hierarchy of ML retrieval systems are separately optimized (trained). Maybe this helps regularize things, but, given enough compute / complexity, it is theoretically possible to differentiate through more of the stack.

mrfox321 · 2024-01-12T19:36:56.000000Z

the police in America

mrfox321 · 2023-12-27T03:14:06.000000Z

For the same reason why he brought up his iq tests in the first place.

Because he self-identifies with some definition of iq.

pbj1968 · 2023-12-27T10:23:33.000000Z

Not at all. The authority figure simply could not live in a world where the question was wrong. A better response, in his role as a school teacher, would have been to tackle the question in depth as a class or otherwise show me how I was mistaken. But given as I was immediately dismissed I’ll never know for sure.

mrfox321 · 2023-12-15T18:01:07.000000Z

Nit to your nit, which is incorrect w.r.t. parent reply:

A Fourier transform is not a projection, it's a change of basis represented by a unitary transformation.

ndriscoll · 2023-12-15T20:19:01.000000Z

The w Fourier coefficient F(w) is the dot product of f with an exponential function, `e_w • f`, and is in that sense a projection. The inverse Fourier transform writes the original function as a sum of the projected components: `f = sum_w (e_w • f) e_w = sum_w F(w) e_w`. This is exactly how writing an "arrow" style 2- or 3-D vector as a sum of orthogonal projections works.