More

kadushka · 2025-03-04T03:04:12 1741057452

That question sounds to me like “have any of you switched away from myspace?”

I last used dropbox ~15 years ago. Since then I used S3, Google Drive, Synology NAS, and iCloud. I don’t need to do any syncing across devices: if I need a file I know where to find it (gmail if it’s a document, icloud if it’s an image/video). I still back everything up to Synology.

nopelynopington · 2025-03-04T13:36:28 1741095388

Synology, thanks I'll take a look

kadushka · 2025-03-04T02:47:32 1741056452

What do you mean? We want images and text to live in the same latent space, and be represented by similar vectors if the two correlate. How else would you want to do it?

kadushka · 2025-03-03T17:19:47 1741022387

Not yet. It will probably start later this year: instead of hiring a junior swe, companies will buy gpt-5 or claude-4 subscriptions and ask senior engineers to deliver more.

Mass layoffs are still 2-3 years away, but I’m expecting this time next year the team I’m on will shrink because of AI.

kadushka · 2025-03-03T03:18:21 1740971901

I wouldn't be great at finding bugs in it or logical flaws

This is what tests are for.

nradov · 2025-03-03T05:49:23 1740980963

You can't test quality into a product.

notepad0x90 · 2025-03-03T05:38:00 1740980280

The tests are probably LLM generated as well lol

kadushka · 2025-03-01T20:27:03 1740860823

So this drug would mess with my head – am I understanding it right?

KaoruAoiShiho · 2025-03-01T20:41:09 1740861669

Everything messes with your head, this adjusts the mess levels.

bozhark · 2025-03-01T20:53:39 1740862419

Maybe gut? Lot’s of serotonin receptors there, right?

kadushka · 2025-03-01T03:23:01 1740799381

I just asked gpt-4o:

“9.9 is larger than 9.11.

This is because 9.9 is equivalent to 9.90, and comparing 9.90 to 9.11, it’s clear that 90 is greater than 11 in the decimal place.”

kadushka · 2025-03-01T02:32:33 1740796353

Ironically, Soviet Union was the product of the far left ideology.

kadushka · 2025-02-28T06:26:16 1740723976

People could tell scaling wasn't working well before the release of GPT 4.5

Who could tell? Who has tried scaling up to this level?

throwaway314155 · 2025-02-28T07:03:09 1740726189

https://www.reuters.com/technology/artificial-intelligence/o...

> Ilya Sutskever, co-founder of AI labs Safe Superintelligence (SSI) and OpenAI, told Reuters recently that results from scaling up pre-training - the phase of training an AI model that use s a vast amount of unlabeled data to understand language patterns and structures - have plateaued.

anshumankmr · 2025-02-28T08:25:40 1740731140

OpenAI took a bullet for the team, by perhaps scaling the model to something bigger than the 1.6T params GPT4 possibly had and basically telling its competitors its not gonna be worth scaling much beyond those number of params in GPT4, without a change in the model architecture

kadushka · 2025-02-24T21:54:55 1740434095

As opposed to vector search, or…?

FeepingCreature · 2025-02-24T22:54:57 1740437697

To my knowledge these are the options:

1. RAG: A simple model looks at the question, pulls up some associated data into the context and hopes that it helps.

2. Self-RAG: The model "intentionally"/agentically triggers a lookup for some topic. This can be via a traditional RAG or just string search, ie. grep.

3. Full Context: Just jam everything in the context window. The model uses its attention mechanism to pick out the parts it needs. Best but most expensive of the three, especially with repeated queries.

Aider uses kind of a hybrid of 2 and 3: you specify files that go in the context, but Aider also uses Tree-Sitter to get a map of the entire codebase, ie. function headers, class definitions etc., that is provided in full. On that basis, the model can then request additional files to be added to the context.

kadushka · 2025-02-25T00:43:31 1740444211

I'm still not sure I get the difference between 1 and 2. What is "pulls up some associated data into the context" vs ""intentionally"/agentically triggers a lookup for some topic"?

throwaway314155 · 2025-02-25T03:58:38 1740455918

1. Tends to use embeddings with a similarity search. Sometimes called "retrieval". This is faster but similarity search doesn't alway work quite as well as you might want it to.

2. Instead lets the agent decide what to bring into context by using tools on the codebase. Since the tools used are fast enough, this gives you effectively "verified answers" so long as the agent didn't screw up its inputs to the tool (which will happen, most likely).

numba888 · 2025-02-24T23:59:12 1740441552

Does it make sense to use vector search for code? It's more for vague texts. In the code relevant parts can be found by exact name match. (in most cases. both methods aren't exclusive)

simonw · 2025-02-25T00:40:59 1740444059

Vector search for code can be quite interesting - I've used it for things like "find me code that downloads stuff" and it's worked well. I think text search is usually better for code though.

kadushka · 2025-02-23T20:00:48 1740340848

I'm still reading the paper, but my main question is how slow is the model compared to LLM of the same size. It seems like to get the best accuracy they need to set number of time steps to the number of tokens to be generated. Does it make it comparable in speed to an LLM?

kadushka · 2025-02-25T04:23:20 1740457400

Update: finished the paper, and as I suspected, there's a serious downside in speed and memory consumption. LLaDA model has to process the entire output sequence on every time step - without anything like KV cache. Also, full quadratic attention happens on the entire output sequence on every time step, which makes it unfeasible for sequence length longer than a few thousand tokens.