Hacker Newsnew | past | comments | ask | show | jobs | submit | voxgen's commentslogin

This could be an explanation for the drama - LLMs are trained to learn and emulate correlations in text.

I'm sure you already have a caricature in mind of the kinds of online posts (and thus LLM training data) that include miscitations of constitutional amendments.


It's not perfect but it does have a few opt-in security features: running all tools in a docker container with minimal mounts, requiring approvals for exec commands, specifying tools on an agent by agent basis so that the web agent can't see files and the files agent can't see the web, etc.

That said, I still don't trust it and have it quarantined in a VPS. It's still surprisingly useful even though it doesn't have access to anything that I value. Tell it to do something and it'll find a way!


I'm working in AI, but I'd have made this anyway: Molty is my language learning accountability buddy. It crawls the web with a sandboxed subagent to find me interesting stuff to read in French and Japanese. It makes Anki flashcards for me. And it wraps it up by quizzing me on the day's reading in the evening.

All this is running on a cheap VPS, where the worst it has access to is the LLM and Discord API keys and AnkiWeb login.


Ratio/quantity is important, but quality is even more so.

In recent LLMs, filtered internet text is at the low end of the quality spectrum. The higher end is curated scientific papers, synthetic and rephrased text, RLHF conversations, reasoning CoTs, etc. English/Chinese/Python/JavaScript dominate here.

The issue is that when there's a difference in training data quality between languages, LLMs likely associate that difference with the languages if not explicitly compensated for.

IMO it would be far more impactful to generate and publish high-quality data for minority languages for current model trainers, than to train new models that are simply enriched with a higher percentage of low-quality internet scrapings for the languages.


It requires tax increases, and the average earner's UBI will typically balance out the tax increase, meaning they don't directly profit.

UBI isn't about giving everyone free money. It's about giving everyone a safety net, so that they can take bigger economic risks and aren't pushed into crime or bullshit work.

The upper half of society will only see the indirect benefits, like having greater employment/investment choices due to more entrepreneurialism.


> due to more entrepreneurialism.

We have three awesome control groups currently:

(1) Retirees with skills don't suddenly decide to become entrepreneurs when they reach 65.

(2) people on the dole don't suddenly become entrepreneurs. We even used to have a specific programme in New Zealand for the unemployed to start their own business . . . I'm fairly sure it didn't work.

(3) mothers on the DPB get a good whack of money even with kids that don't need Hyde time investment. It is rare to see them do anything more entrepreneurial than an under-the-table job.

I love your optimism, but it isn't realistic.


> It requires tax increases, and the average earner's UBI will typically balance out the tax increase, meaning they don't directly profit.

A good portion of my salary is already taken by Tax and the government wastes it. I've seen the waste first hand when contracting for both Local, Nation Government. I was so disgusted by this, I have made every effort to avoid working with them.

I've also seen this waste happen in large charities and ossified corporations. The former also disgusting me as I know they would simply piss away a few thousand on complete BS, that took a whole village to collect and for it not to go towards the stated purpose of the charity. As a result I don't donate to any charities that aren't local.

Every-time someone suggests a tax increase, I know for a fact they haven't seen the waste happen first hand.

> UBI isn't about giving everyone free money. It's about giving everyone a safety net, so that they can take bigger economic risks and aren't pushed into crime or bullshit work.

Giving everyone a safety net will require giving people money that is taken from others. To the people that benefit it is seen as "free" and will become "expected" and won't be treated as a safety net.

Being a responsible adult is about reducing the amount of risk you are taking, not increasing it.

So what you will be doing it teaching people to essentially gamble and people did similar during COVID. Some people took their cheques and put it into crypto, meme stocks or whatever. Some won big, most didn't.

I've met people in my local area that have lost huge amounts of money on risky investments, everything from property developments, to bitcoin. Creating an incentive for risk taking without the consequences is actually reckless, a massive moral hazard and will simply create perverse incentives.

> The upper half of society will only see the indirect benefits, like having greater employment/investment choices due to more entrepreneurialism.

You will be taxing those people more and they will have less to invest. The reason why many people invest is because they have disposable income that they can afford to risk.

By taxing people more (which you admit would have to happen), they will have less disposable income and will be inclined to invest less as a result.


That discussion also makes me worry that they may try to use LLMs or LLM-based metrics to measure the size of the gap as a proxy for value of the content.

The landlord of the marketplace should probably not dabble in the appraisal of products, whether for factuality or value.


> without punishing regular browsing humans.

As a content consumer, I'm also hoping to be part of the ecosystem. I already use Patreon a lot as "AdBlock absolution", but it doesn't fix the market dynamics. Major content platforms tend to stagnate or worsen over time, because they prefer to sell impressions to advertisers than a good product to consumers.


What makes you think the secrets are small enough to fit inside people's heads, and aren't like a huge codebase of data scraping and filtering pipelines, or a DB of manual labels?


Please consider also describing the business model on the website, even if hidden away on a FAQ. I've so much subscription fatigue now, I just don't try things out if needing a subscription is an inevitability. I'm happy to pay for good products, just not happy to be forced to pay a fixed rate for continued access even if my usage dwindles.

If you are thinking of adding a one-off-donation-style purchase method, consider giving annual reminders to renew it. At least in my case, I'm not unwilling to pay repeatedly if development continues, just unwilling to make an upfront ongoing commitment.


I don't think retrofitting existing languages/ecosystems is necessarily a lost cause. Static enforcement requires rewrites, but runtime enforcement gets you most of the benefit at a much lower cost.

As long as all library code is compiled/run from source, a compiler/runtime can replace system calls with wrappers that check caller-specific permissions, and it can refuse to compile or insert runtime panics if the language's escape hatches would be used. It can be as safe as the language is safe, so long as you're ok with panics when the rules are broken.

It'd take some work to document and distribute capability profiles for libraries that don't care to support it, but a similar effort was proven possible with TypeScript.


I actually started working on a tool like that for fun, at each syscall it would walk back up the stack and check which shared object a function was from and compare that to a policy until it found something explicitly allowed or denied. I don't think it would necessarily be bulletproof enough to trust fully but it was fun to write.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: