ArVID220u's comments

ArVID220u · 2025-01-14T03:26:53 1736825213

cursor dev here. reasonable assumptions, but not quite the case. the snyk packages are just the names of our bundled extensions, which we never package nor upload to any registry. (we do it just like how VS Code does it: https://github.com/microsoft/vscode/tree/main/extensions)

we did not hire snyk, but we reached out to them after seeing this and they apologized. we did not get any confirmation of what exactly they were trying to do here (but i think your explanation that someone there suspected a dependency confusion vulnerability is plausible. though it's pretty irresponsible imo to do that on public npm and actually sending up the env variables)

nomilk · 2025-01-14T04:56:33 1736830593

> "pretty irresponsible"

Wouldn't it be more like "pretty illegal"? They could have simply used body: JSON.stringify("worked"), i.e. not sent target machines’ actual environment variables, including keys.

reubenmorais · 2025-01-14T07:56:05 1736841365

It's an unfortunate incentive structure. If you're doing offensive security research, there's two ways you can go about it: you can report the potential vulnerability without exploiting it, in which case you risk the company coming back to you and saying "thanks but we don't consider this a vulnerability because it's only exploited through misconfiguration and we're too smart for that". Maybe you get some token reward of $50.

Or you can exploit it and say here's the PoC, this many people at your company fell for it, and this is some of the valuable data I got, including some tokens you'll have to rotate. This puts you into actual bug bounty territory. Certainly the PR side of things alone will incentivize them to pay you so you don't make too much of a noise about how Cursor leaked a bunch of credentials due to a misconfiguration that surely every good programmer knows about and defends against (like so many vulnerabilities seem so dumb in hindsight).

xmodem · 2025-01-14T11:45:42 1736855142

Cursor does not have a bug bounty though, and its hard to see how this constitutes anything other than a direct attack on them, their users, or both. "The incentive structure made me do it" does not justify acting like a criminal.

grajaganDev · 2025-01-14T13:22:33 1736860953

Cursor asks researchers to report vulnerabilities to their GitHub security page.

The same incentive to show impact applies even without a paid bounty.

px43 · 2025-01-14T13:12:26 1736860346

> Cursor does not have a bug bounty

Shouldn't this alone be considered criminal negligence at this point? Cursor isn't some random open source project. It's a company that has funding, and subscriptions. Hell, I pay Cursor for a monthly subscription. Pretty incredible that they have no bounty program.

britannio · 2025-01-14T13:35:09 1736861709

The lack of a bug bounty program doesn't prohibit them from rewarding reported vulnerabilities.

taskforcegemini · 2025-01-14T17:19:12 1736875152

do they though?

miki123211 · 2025-01-14T10:43:18 1736851398

You can also console.log those credentials as a PoC, and then show that the console.log could trivially be replaced by a fetch().

Kind of like a lot of exploit PoCs just "pop a calc" (AKA open the Calculator app), not because opening the calculator is valuable to an attacker, but because if you can open calculator, you can do anything.

objectified · 2025-01-14T13:00:25 1736859625

The problem there though, is that with PoCs like this, as an attacker you want to have a ping back to your system so that you know the attack has been successful (in this case they probably expected/hoped someone at Cursor to install the package, that's the usual objective in a dependency confusion attack). But what they could have done, is send a less sensitive thing like just the current working directory or current effective user, instead of the whole environment.

fkyoureadthedoc · 2025-01-14T14:50:08 1736866208

What actually changes though in your scenario? Potential bad actor gets RCE on your dev machines, it doesn't really matter what they sent home, you're rotating keys and doing your due diligence either way.

mistercow · 2025-01-14T09:14:50 1736846090

I wonder how viable it would be to find a public key your target owns and use it to encrypt the data you send back. Then you could prove to them that you exfiltrated real data without exposing it to anyone outside the company.

Alternatively, you could hash it and say “Look, it’s a sha of your database password hyphen “yougotpwnd””

necovek · 2025-01-14T11:59:56 1736855996

HTTPS certificates should already have that public key for you, so it should be trivial.

Szpadel · 2025-01-14T12:03:16 1736856196

wouldn't capturing only env names without values be ideal middle ground?

look we had access to your Aws tokens, we could take over your account but we didn't steal actual token, we just got proof that we could access it

reubenmorais · 2025-01-14T17:37:44 1736876264

Yes I agree names only would have been a better approach here.

6mile · 2025-01-14T08:27:17 1736843237

Yeah, I agree the incentive structure is broken for bug bounty hunters. Until the BB platforms themselves create some rules for their customers and researchers, we are gonna continue to have the sh*t show that we do now. The reality is that bug bounty hunters are deploying a significant percentage of the total malicious NPM packages each month.

mcherm · 2025-01-14T09:42:30 1736847750

> The reality is that bug bounty hunters are deploying a significant percentage of the total malicious NPM packages each month.

I don't actually think that is a bad thing.

The TSA screening at airports would be vastly better if TSA maintained a "red team" that regularly tried smuggling guns (or water bottles or whatever) into airports. The agents would be more attentive if the number of incidents they dealt with was large enough that they could practice more often. The system could improve if it had actual feedback on how accurate and effective it was. And instead of agents overreacting or underreacting they could tune their responses to an appropriate level.

The same applies to supply chain attacks. The REAL ones are rare, dangerous, and performed by experts; having a chance to practice catching them, to assess our detection rates, and to adjust our reactions is healthy.

dagw · 2025-01-14T09:46:55 1736848015

The TSA screening at airports would be vastly better if TSA maintained a "red team" that regularly tried smuggling guns

They actually do have this. TSA seem to still suck at their job:

https://www.forbes.com/sites/michaelgoldstein/2017/11/09/tsa...

https://www.gao.gov/products/gao-19-374

LtWorf · 2025-01-14T16:47:57 1736873277

You'd also suck if you knew your job is useless busywork.

whimsicalism · 2025-01-14T16:02:59 1736870579

a prerequisite of “offensive security research” is that it is solicited, no ifs or buts.

what they did was absolutely wrong and frankly likely illegal

peanut-walrus · 2025-01-14T09:14:31 1736846071

Yeah, it sucks, but that's the way it is. It is super common for bug bounty findings to be ignored or downgraded unless you show actual code exec on their machines or dump some of their creds.

DigitalNoumena · 2025-01-14T10:29:02 1736850542

It may interest you that Guy Podjarny, one of the Snyk founders, now has an AI coding company (https://www.tessl.io/about) that looks like a competitor of yours

IAmGraydon · 2025-01-14T04:37:37 1736829457

[flagged]

not_a_bot_4sho · 2025-01-14T05:32:00 1736832720

It was a thing back in the late 90s. I still do it in casual conversations with friends, less so in professional settings.

It's a gen X thing, like using "lol" to mean literal laughter

johnny22 · 2025-01-14T05:51:21 1736833881

it's been a thing at least on irc for at least 20 years. i've been used to it for a long time.

dovin · 2025-01-14T05:01:55 1736830915

I like to call it informal case.

furyofantares · 2025-01-14T05:42:50 1736833370

When I grew up online in the 90s, on IRC, AOL/AIM, ICQ and web forums, it was extremely common. Most of the people I know from then still do it, and I still do it with them and in many other places, although for whatever reason I don't do it here. Although it's 50/50 when on their phones now that phones auto-capitalize by default now.

urig · 2025-01-14T06:37:11 1736836631

Rules are put in place to be followed, for a reason. Capital letters at the start of the sentence increase readability. People who don't bother with them are being incosiderate towards their readers.

wahnfrieden · 2025-01-14T08:09:25 1736842165

not at all

pizza · 2025-01-14T05:33:16 1736832796

yes but there could be many possible reasons, for instance

- it's muuch faster on mobile

- you're aiming to convey litheness to potential target audiences who will know to recognize it as intentional litheness

- you've gotten used to minimizing the amount of keystrokes necessary for communicating things, to the point it's second nature

- you've worked a lot in the past with older nlp systems, where ignoring capitalization was a given for inputs anyhow, and just got used to treating it as syntactic cruft only strictly necessary in more formal settings ;)

cjbprime · 2025-01-14T06:46:53 1736837213

With the current default mobile keyboards, I'd guess it's slower, not faster.

stouset · 2025-01-14T07:08:16 1736838496

> it's muuch faster on mobile

This isn’t 1998. Mobile keyboards autocapitalize. You have to go out of your way to avoid capitalization on mobile.

Piisamirotta · 2025-01-14T06:46:24 1736837184

I have been thinking of this too. I find it super annoying to read and it looks unprofessional.

demarq · 2025-01-14T07:09:06 1736838546

how does this bother you, what greater meaning does it have?

IAmGraydon · 2025-01-16T00:27:05 1736987225

I tend to shy away from intentional illiteracy and laziness, both of which this is an example of. Not capitalizing does also affect readability. That said, I was honestly asking because I’ve seen it a few times on HN in the last couple of weeks and was curious if it’s coincidence or an actual trend.

Atotalnoob · 2025-01-14T04:43:20 1736829800

It’s a techbro thing.

Sama does it too

benatkin · 2025-01-14T05:02:35 1736830955

You're writing in rust-inspired English it seems, omitting the punctuation mark at the end of your second sentence so it gets returned.

compootr · 2025-01-14T05:07:31 1736831251

it's typesafe and efficient

brookst · 2025-01-14T05:42:49 1736833369

e. e. cummings, techbro. who knew?

mmaunder · 2025-01-14T05:29:44 1736832584

It’s a low effort flex. As in: you’re unimportant, this is unimportant, and I’m very busy, so I can’t or won’t bother to capitalize. Which is ironic because it’s more effort to not capitalize.

wahnfrieden · 2025-01-14T08:10:44 1736842244

it's many more keypresses, and using modifier keys is generally rsi-prone

maxbond · 2025-01-14T08:39:44 1736843984

Without commenting on this subthread (I don't have an opinion), you or anyone else with this concern should look into sticky modifiers (modifiers that apply to the next key press without being held). They were a game changer for me personally as far as managing RSI, as I had a bad habit of tilting my wrist to eg type a capital T.

wahnfrieden · 2025-01-17T01:06:36 1737075996

I use thumb keys (Glove80) which helps but I need to give that a try too

syndicatedjelly · 2025-01-14T03:31:57 1736825517

[flagged]

benatkin · 2025-01-14T04:06:03 1736827563

Yeah, I strongly disagree with the way it's characterized here.

TimTheTinker · 2025-01-14T04:17:32 1736828252

> we reached out to them after seeing this and they apologized.

How does this make it sound like they made Snyk apologize?

ArVID220u · on Sept 26, 2023

This looks really cool! I'm excited to see a production deployment of DiskANN.

According to the single-threaded QPS experiments, your DiskANN solution should clock in at about 4.5ms latency (1000ms/224QPS) whereas pgvector is about 5.8ms latency (1000ms/173QPS). How is that possible? My (very shallow) knowledge of DiskANN vs HNSW tells me that DiskANN should generally have higher latency than HNSW — DiskANN needs to touch the SSD while HNSW only touches RAM.

Also, compared to pgvector and HNSWPQ in faiss, how much less RAM does your DiskANN-based solution use?

avthar · on Sept 26, 2023

(Blog author here). Thanks for the question. In this case the index for both DiskANN and pgvector HNSW is small enough to fit in memory on the machine (8GB RAM), so there's no need to touch the SSD. We plan to test on a config where the index size is larger than memory (we couldn't this time due to limitations in ANN benchmarks [0], the tool we use).

To your question about RAM usage, we provide a graph of index size. When enabling PQ, our new index is 10x smaller than pgvector HNSW. We don't have numbers for HNSWPQ in FAISS yet.

[0]: https://github.com/erikbern/ann-benchmarks/

ArVID220u · on March 15, 2023

that's exactly what we're building with Control! https://twitter.com/sualehasif996/status/1635757961984221185

Krutonium · on March 15, 2023

Any chance me, a lowly OSS Developer and Twitch Streamer of the Small Variety, could acquire access early? I would love to try using it for things like writing Mods for Games and such.

sualehasif · on March 15, 2023

Feel free to email us, and we can make something work!

ArVID220u · on Aug 15, 2022

Paid monthly subscriptions.

ArVID220u · on Aug 15, 2022

Scaling is definitely why this hasn't been done before — it is hard! Recent research in private information retrieval shows that we can do it for 1 million people (see e.g. https://www.usenix.org/conference/osdi21/presentation/ahmad and https://www.cs.utexas.edu/~dwu4/spiral.html). All of the recent advances still scale as n^2 with the number of users, which becomes prohibitive after 1 million people, but we are working on a paper that shows how we can scale subquadratically (n^{1+1/k}*c^k for a fixed c and any k). With an n^1.33 scheme, 1 billion people suddenly becomes feasible.

ArVID220u · on Aug 15, 2022

With all respect, it is not obviously false. You should take a look at our whitepaper: https://anysphere.co/anysphere-whitepaper.pdf (the figure on page 2 might be helpful). To my knowledge, no other communication platform provides complete metadata privacy like we do. I would love to be disproved here though! It would be awesome if other completely private communication tools exist.

I do agree that there is a general problem where companies make hyperbolic claims, especially when it comes to security/privacy. For example, “zero trust” has been abused by so many people that even in the cases where it might apply, you cannot say that it applies to you because the term has lost its meaning. In our case, users do not need to place any trust in our servers, but we decided not to call it “zero trust” because people have taken it to mean “trust your employees less” or something else similarly outrageous.

If you have any suggestions for how we can improve our messaging here I’d love to take them. In other words, how could we convey that we are indeed the only completely private communication platform, without provoking a reaction similar to yours?

andrewmcwatters · on Aug 15, 2022

It is obviously false. End-to-end encryption doesn't leak metadata in the way you propose it does in your whitepaper. And it doesn't help that you don't define metadata in your paper, you just repeat it over and over again.

Specifically, what are you talking about protecting? Does this extend to deep packet inspection? Because your paper doesn't mention anything about that either. OR, you know, literally just talking to another server. You don't mention relays. You casually mention Tor in passing but make no concrete statements about the design of it by comparison.

Your paper isn't rigorous, your claims are superfluous, and they further attempt to discredit security progress across the entire field.

YOU are the only one who has ever created truly secure communication? Get real. What a complete joke.

It's like someone selling water and saying no one has ever created pure water before US.

Edit: If you want to appeal to security people, use plain language, be precise, and state your intentions. You do none of those things with this software.

Instead you:

* Use provably wrong marketing language

* Propose a provably wrong whitepaper

* Do not state your intentions for building the software

What it looks like to me is that you received some modest funding ($200,000) to write software and your sponsors didn't realize your work doesn't pass the smell test.

shengtongzhang · on Aug 15, 2022

Shengtong chiming in here. We are working on a rigorous security proof here https://anysphere.co/anysphere-security-definition.pdf. Included in it is a definition of metadata, a definition of exactly what we are defending against, as well as a rigorous proof of defense against adversaries that can manipulate packets. It is still work in progress, so there may be a lot of typos, but I believe it is a correct proof.

Let us know if there is anything else you want to be proved, or if the adversary in the paper is not strong enough :).

ArVID220u · on Aug 15, 2022

Good idea! I added a few to the linked Gist.