Hacker News new | past | comments | ask | show | jobs | submit | ai-christianson's comments login

Seems like an ideal compression method for LoRa/Meshtastic-style communication. An LLM wouldn't run on an ESP32, but there are several that could run on a raspberry pi.

It's not just natural language that could be compressed this way, either. Code (HTML, JS, etc) could be compressed with the same technique/models. I bet that the same general idea could work for image compression as well, using an image/diffusion model (or perhaps a multimodal model for everything.)

This could lead to an entire internet of content using just a few bits.


The key insight is that the larger the shared context between parties, the more efficient communication can be, as communication tends towards a purely relational construct. The limit of this is two parties that both share the exact same context and inputs, the inputs should produce the same hidden state within both parties and communication is not even necessary because both parties have the same knowledge and state.

That's not new to anyone familiar with compression or information theory, but the novelty here is the LLM itself. It's absolutely plausible that, given an already highly compressed relationally-encoded context like a trained LLM, very few bits could be communicated to communicate very abstract and complex ideas, letting the LLM recontextualize information which has been compressed across several semantic and contextual layers, effectively leveraging a complete (but lossy) history of human knowledge against every single bit of information communicated.


LORA is also pretty slow, like the 'long fast' mode that most meshtastic users use is about a kilobit per second... and presumably a small percentage of the traffic at any time is traffic in channels that you're monitoring.

Probably decoding few tokens per second is fast enough to deliver more goodput than the existing uncompressed usage.


Would you consider giving these crawlers access if they paid you?

Interesting idea, though I doubt they'd ever offer a reasonable amount for it. But doesn't it also change a sites legal stance if you're now selling your users content/data? I think it would also repel a number of users away from your service

At this point, no.

No, because the price they'd offer would be insultingly low. The only way to get a good price is to take them to court for prior IP theft (as NYT and others have done), and get lawyers involved to work out a licensing deal.

This is one of the few interesting uses of crypto transactions at reasonable scale in the real world.

What mechanism would make it possible to enforce non-paywalled, non-authenticated access to public web pages? This is a classic "problem of the commons" type of issue.

The AI companies are signing deals with large media and publishing companies to get access to data without the threat of legal action. But nobody is going to voluntarily make deals with millions of personal blogs, vintage car forums, local book clubs, etc. and setup a micro payment system.

Any attempt to force some kind of micro payment or "prove you are not a robot" system will add a lot of friction for actual users and will be easily circumvented. If you are LinkedIn and you can devote a large portion of your R&D budget on this, you can maybe get it to work. But if you're running a blog on stamp collecting, you probably will not.


Use the ex-hype to kill the new hype?

And the ex-hype would probably fail at that, too :-)


What does crypto add here that can't be accomplished with regular payments?

I think they're real. The model is performing better than claude-3-5-sonnet-20241022 on the claude leaderboard:

https://aider.chat/docs/leaderboards/


It was a response to the approach of carefully planning and specifying every aspect of the software up-front.

The problem with that "waterfall" approach is that, in the real world, we often work in fast moving/evolving problem domains, or domains where we discover more as we build. Changing requirements are disastrous to the waterfall approach and can kill projects easily.

If we're to be pragmatic at all, we need some flexibility/adaptability in the process.


> I certainly see a point when people switch to XFCE from Gnome or KDE. But what about if coming from the other side of the spectrum?

I think you basically answered it right there.

It sits between. For the lxde people it would be more of a full fledged desktop, but not with the full weight of KDE or Gnome.


Thanks!

Yeah, I understand that it sort of sits in between. But what are those "additional" features compared to a more spartan DEs? More included apps / accessories? More access to the machine's properties (like, more GUIs for various options)? Something else?


I have not tried LXQt properly or recently but I think both those and lots of applets.


There's a lot of nuance in how AI is used vs. the fact that it is used at all. Which specific AI tool is used matters a lot as well.

For example, LLMs can generate a ton of slop, whether in writing form, code, or anything else. But if used thoughtfully, they can lead to higher quality code bases with less tech debt and allow higher quality products to be built faster.

If someone is using the right tools for the right reasons, I see it as a good thing.


Is this worth going through the effort of hosting?


Tried it out. The ideas seemed pretty good. The downfall was the names.

Has anyone else noticed that? The AI (in general, not just this product) seems really bad at coming up with names for anything.


Thanks for trying it out, and glad to hear the idea generation is working well

You are absolutely right, those names are often a miss, very surface level or just 2 words glued together.

I guess that will be my next challenge to solve, thanks!


Are you finding building MCP integrations to be worth it? We've been using agents (e.g. langchain), which are pretty good at bringing in context and taking actions. Tool results become part of the context --it just works.


Good thing to me (besides being an open spec) is their simplicity, with libraries such as FastMCP you can just bring stuff you already have implemented into Claude (or any MCP client).


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: