More

zaptrem · 2025-02-05T19:18:44 1738783124

"You can train a SOTA LLM for $0.50" (as long as you're distilling a model that cost $500m into another pretrained model that cost $5m)

knutzui · 2025-02-05T21:35:46 1738791346

The original statement stands, if what you are suggesting in addition to it is true. If the initial one-time investment of $505m is enough to distill new SOTA models for $0.50 a piece, then the average cost for subsequent models will trend toward $0.50.

fizx · 2025-02-05T21:00:48 1738789248

That's absolutely fantastic, because if you have 1 good idea that's additive to the SOTA, you can test it for a dollar, not millions

zaptrem · 2025-02-05T06:49:08 1738738148

This is how I think about unlimited data plans haha. I think a rate limit is easier to stomach (e.g., X requests per hour where they bank up to one hour so you can burst up to 2X for especially crazy hours or something).

annexrichmond · 2025-02-05T07:09:58 1738739398

yeah that's interesting. $5 for 35/day could be better as well

zaptrem · 2025-01-23T03:30:07 1737603007

Transformers are deep feedforward networks that happen to also have attention. Causal LMs are super memory bound during inference due to kv caching as all of those linear layers need to be loaded onto the core to transform only a single token per step.

menaerus · 2025-01-23T08:30:42 1737621042

And I said something else?

zaptrem · 2025-01-20T08:32:54 1737361974

The models are not self aware of their training data. They are only aware of what the internet has said about previous models’ training data.

sitkack · 2025-01-20T17:08:23 1737392903

I am not straight up asking them. We know the pithy statement about that word.

zaptrem · 2024-12-12T17:45:59 1734025559

Problem is microplastics

zaptrem · 2024-12-11T18:54:43 1733943283

Haven't we had this for months now? I've used all of these already (though maybe they're just in the developer betas?)

zaptrem · 2024-12-11T18:44:11 1733942651

It is a lot easier to switch LLMs than it is to switch smartphone platforms.

zaptrem · 2024-12-09T09:41:34 1733737294

Have there been compelling use cases for on device LLMs where the value isn’t based on privacy benefits yet?

Tepix · 2024-12-09T11:22:18 1733743338

For me complete privacy is a must-have for an LLM that gets access to pretty much all my data (mails, calendar, location, browser history, chats, address book, health, app use, ...).

But there are other benefits such as the availability, even when your phone is offline, latency and no cost per use.

Cheer2171 · 2024-12-09T12:34:11 1733747651

But you are already giving Google, Apple, and/or Microsoft all that data anyway.

Tepix · 2024-12-09T15:49:58 1733759398

I'm not. Are you?

polyomino · 2024-12-09T10:07:00 1733738820

Lower latency, which can be useful for live translation or keyboard autocomplete

zaptrem · 2024-12-08T19:43:48 1733687028

Assuming it was sold profitably wouldn't this just cause more to be built?

01HNNWZ0MV43FF · 2024-12-08T19:56:33 1733687793

Exactly. But voters don't want welfare, they want price caps. Even though one works and the other doesn't.

zaptrem · 2024-12-06T05:31:22 1733463082

NYC's defining trait and constant throughout history is development. The concept of a tiny row houses and a car on every driveway in Queens was invented in the mid 20th century when Robert Moses scarred a half dozen neighborhoods to make it that way.