More

mkeeter · 2026-04-30T17:27:04 1777570024

A repository search shows 2.2K repos with the text "A Mini Shai-Hulud has Appeared", all created within the past day:

https://github.com/search?q=A%20Mini%20Shai-Hulud%20has%20Ap...

rhdunn · 2026-04-30T17:35:30 1777570530

The repository names all look like two terms/words from dune (harkonen, mentat, ornithoptor, etc.) followed by a number. This would indicate that the account (possibly GitHub auth/actions token) has been compromised and then used to create the repository.

avaer · 2026-04-30T21:40:25 1777585225

Why can't GitHub get on the case and just block any repo where the README matches the regex? I thought they'd have learned their lesson the last time it happened.

This malware isn't even trying. Then again it's Microsoft so they're not even trying either.

eddythompson80 · 2026-04-30T22:15:54 1777587354

6 minutes later an HN submission "GitHub blocks your account if you mention X in the README" with a top comment "This is absurd, are they just doing regex matching to check for malware?"

bbor · 2026-05-01T04:34:37 1777610077

1. This happened less than 24 hours ago.

2. This is just one of the four techniques the worm uses to phone home.

sgskinner · 2026-05-01T01:18:33 1777598313

“Some people, when confronted with a problem, think ‘I know, I’ll use regular expressions.’ Now they have two problems.”

ramon156 · 2026-05-01T09:41:57 1777628517

https://github.com/tinin46

this account seems to store a lot of keys, not sure what theyre for

spate141 · 2026-04-30T17:28:10 1777570090

what's this all about?

progbits · 2026-04-30T17:31:15 1777570275

Malware uploading the credentials it managed to steal

foo12bar · 2026-04-30T17:35:24 1777570524

FTFA

> The attack steals credentials, authentication tokens, environment variables, and cloud secrets, while also attempting to poison GitHub repositories.

CodeAndCuffs · 2026-04-30T17:40:29 1777570829

That doesn't really explain why there is a bunch of GitHub repos created as well.

If I remember correctly from Shai-Hulud 2, the attacker extricated creds by posting them in public github repos with minor easily reversible encryption. I believe it was double b64 last time.

I'm assuming the logic there is that every security researcher and company is going to pull and scan those creds for their stuff and their clients' stuff. So the attacker is just 1 of N people downloading it. As opposed to trying to send it to their own machine directly.

arsome · 2026-04-30T18:04:30 1777572270

I think it's more about convenience and bypassing filters - developers are already logged in to github, already have access to create repos and publish code, firewalls will allow it. Even fancy HIDS systems will think the git push is rather normal.

If they have a clue, the attacker still will not download that without using a botnet tunnel or Tor at a minimum.

Note though that these credentials aren't even encrypted using some lightweight ECC to prevent others from capturing them, they're posted in cleartext. Embarassment might be part of the point.

bbor · 2026-05-01T04:37:01 1777610221

With HN ettiquette in mind, I must make an exception: this is a case where skimming the first parts of the article would help a lot!

The public repo path is just one of four parallel paths, with the goal of getting around any barriers:

  The exfiltration component shares its design with the "Mini Shai-Hulud" mechanism from their last campaign, using four parallel channels so stolen data gets out even if individual paths are blocked.

mkeeter · 2026-04-09T15:14:51 1775747691

Cryptic Titles Are All You Need

mkeeter · 2026-04-06T19:24:25 1775503465

The review is also heavily LLM-inflected, to the point of being distracting.

GPTZero gives it a 100% chance of being AI generated, and I've found that these tools may give false negatives from a well-prompted model, but false positives are rare.

If you are looking to tune your intuition for AI-written text, here's an interesting list of their quirks (ironically provided as a Claude skill for removing those quirks from emitted text):

https://github.com/stephenturner/skill-deslop/blob/main/refe...

geoffschmidt · 2026-04-06T20:48:25 1775508505

I'm not so sure about false positives being rare.. ZeroGPT flags the Gettysburg Address as 96% AI generated:

https://www.reddit.com/r/ArtificialInteligence/comments/1s0y...

(I tried it just now and got the same result as in that post)

jasomill · 2026-04-06T23:53:14 1775519594

According to that site, Robert Kennedy's speech on the night Martin Luther King was killed[1] was almost entirely the product of GenAI, as were both of Obama's inaugural addresses[1][2].

By this logic, I'd venture a guess that "AI" was also responsible for some of Shakespeare's most famous lines.

[1] https://www.jfklibrary.org/learn/about-jfk/the-kennedy-famil...

[2] https://obamawhitehouse.archives.gov/realitycheck/the_press_...

[3] https://obamawhitehouse.archives.gov/the-press-office/2013/0...

mkeeter · 2026-04-06T20:54:18 1775508858

Fair enough, I accept "the blog post was written by someone from the 1800s" as an alternative hypothesis.

edit: For what it's worth, I also just tested the Gettysburg Address (using the "Bliss Copy" from [1]), and got a "100% Human" score.

[1] https://www.abrahamlincolnonline.org/lincoln/speeches/gettys...

mkeeter · 2026-03-27T19:10:46 1774638646

The "Intermediate Report" [1] lists the authors as "Robert V. and Claude (Anthropic)". Is there any reason to believe this is not AI hallucinations?

[1] https://stateofutopia.com/papers/2/intermediate-report.pdf

pseudohadamard · 2026-03-28T03:32:04 1774668724

Almost certainly. Someone no-one has ever heard of before driving a hallucinating AI claims to have done what the world's best cryptographers have been unable to do. Just wait a day or two for the first crypto person who notices to pick the claim to pieces.

logicallee · 2026-03-28T11:57:39 1774699059

>Just wait a day or two for the first crypto person who notices to pick the claim to pieces.

we went to cryptographic experts first and published second, after they said it is a very good result and worth publishing. We've given a lot of help for reproducibility, the c and python programs encode the claims very precisely and anyone can verify the claims in ten minutes. The bottom line is that you wouldn't have seen this article if cryptographers hadn't seen these results first and liked them.

Freak_NL · 2026-03-28T14:00:25 1774706425

We? Is this the royal we, your highness? You are just one person right?

66yatman · 2026-04-04T21:03:23 1775336603

Their/they

dolmen · 2026-03-30T05:00:53 1774846853

None.

logicallee · 2026-03-27T19:50:41 1774641041

[flagged]

Retr0id · 2026-03-27T20:09:15 1774642155

If you can't tell the difference between MD5 and SHA-256, you should not be making claims such as the one in the title.

logicallee · 2026-03-27T20:37:43 1774643863

edited to clarify, thanks for pointing it out. It wouldn't be responsible for us to only publish when we got to the same stage for SHA-256, since at that point TLS and other certificates would be considered compromised.

seba_dos1 · 2026-03-27T21:36:48 1774647408

> Great question, and you're right to be skeptical.

Hi Claude! You're absolutely right!

thiht · 2026-03-27T22:10:53 1774649453

Got the same vibe from reading that sentence, reading AI replies on HN is so annoying…

kstrauser · 2026-03-27T19:56:27 1774641387

> You can use literally any MD5 tool

> Our certificates implement the full SHA-256 algorithm

We knew MD5 is broken. Do you have a POC for breaking SHA-256, too?

mkeeter · 2026-03-24T15:54:07 1774367647

They certainly have ambitions – the most recent changelog claims to add "Full PCB design pipeline: schematic capture, routing, DRC, Gerber export, and signal integrity simulation."

It also seems to have a physics engine, a slicer for 3D printing, an embroidery mode, and a entire ecosystem of math crates (https://tang.toys/).

Whether any of that works – or whether it's pure LLM slop – is less clear. I tried to import a trivial STEP file, and it crashed my browser tab [1]. Every commit is co-authored by Claude.

[1] https://github.com/ecto/vcad/issues/7

ecto · 2026-03-24T17:02:43 1774371763

Thanks for the bug report! I'll have some time later this week to look into it. Just had a baby :)

mkeeter · 2026-03-24T17:54:58 1774374898

Congrats!

(We had one back in December; you’re in for a fun ride!)

dr_win · 2026-03-24T16:43:40 1774370620

...and don’t forget Loon Lang — it’s a gem: https://loonlang.com

By the way, “they” is actually just one person: Cam Pedersen — https://campedersen.com

So far, he’s shown incredible productivity (with Claude Code). I integrated his vcad into my toy project here, and it worked on the first try, which is quite impressive for such a young project: https://github.com/darwin/supex/tree/dev

Definitely keep an eye on him.

ecto · 2026-03-24T17:02:52 1774371772

mkeeter · 2026-03-19T23:13:50 1773962030

Not a typo, but you’re correct about the sample rate - with those settings, the scope was doing interpolation between samples.

nomel · 2026-03-20T00:15:06 1773965706

By definition, you can't interpolate a sample. A sample is a measured value.

What you can do, if and only if you have an exactly repeating signal triggering at the same point within a cycle, is change the delay between the trigger and sample, and repeat. In other words, sample at different times within the same signal (since it's exactly repeating), to build up samples in time, of that waveform, to whatever time resolution you want.

Of course, you're limited to any noise in the trigger, variation in the signal, etc.

This is how you can record light moving through your garage [1]!

[1] https://www.youtube.com/watch?v=o4TdHrMi6do

jacquesm · 2026-03-20T00:26:23 1773966383

Not sure if mkeeter's comment has been ninja edited but it says between samples, it doesn't say it is interpolating to generate new samples.

nomel · 2026-03-20T01:00:04 1773968404

I understand, but that's my point, it's not interpolated!

The number he's referring to is in units of samples per second. It's not doing interpolation between samples, to achieve a high samples per second, because that's not possible, which is my point. Interpolation results in an imagined value, but samples are measured values.

It would be correct to say that the values between samples are interpolated, but the subject of interpolation isn't applicable for anything mentioned in this comment chain.

jacquesm · 2026-03-20T01:02:30 1773968550

Ah you are referring to the 'sps' bit. Ok, but I think the extra sentence is enough clarification of what they mean, even if they're wrong about what the device is doing.

The only time these are interpolating is when they are visualizing, there is no point (hah) in storing interpolated data, you can generate that whenever you want.

blharr · 2026-03-20T07:19:09 1773991149

Not the original reply, but I support the correction here. Regardless of how pedantic/nitpicking it seems, I remember getting confused about this a lot when learning digital signal processing. Simply because its really easy to upsample.. or look at an upsampled result and get confused by that

jacquesm · 2026-03-20T09:59:11 1774000751

I think 'upsample' is the root cause here. Technically that is a misnomer.

mkeeter · 2026-02-22T18:08:44 1771783724

More evidence: the user posted three well-formatted multi-sentence comments within 15 seconds.

https://news.ycombinator.com/item?id=47110801 (13:23:08)

https://news.ycombinator.com/item?id=47110803 (13:23:15)

https://news.ycombinator.com/item?id=47110804 (13:23:23)

mkeeter · 2026-02-08T01:03:00 1770512580

I have received 168 support ticket emails in the past 30 mins, and Gmail has not yet learned to flag them as spam.

This is an absolute clown show.

Edit: whoops, this was incorrect! I had received over 1,000 Zendesk emails, 168 of which made it into my inbox.

mkeeter · 2026-02-04T03:15:29 1770174929

Which NRTL did you end up using for certifications? Can you say more about that process?

mkeeter · 2026-01-28T13:01:47 1769605307

oh hi ChatGPT

The giveaway is that LLMs love bulleted lists with a bolded attention-grabbing phrase to start each line. Copy-pasting directly to HN has stripped the bold formatting and bullets from the list, so the attention-grabbing phrase is fused into the next sentence, e.g. “Potential for abuse Attestation enables blacklisting”

ingohelpinger · 2026-01-28T19:15:40 1769627740

Calling this a "giveaway" is kind of hilarious. LLMs use bulleted lists because humans have always used bulleted lists—in RFCs, design docs, and literally every tech write-up ever. Structure didn't suddenly become artificial in 2023. lol.

WD-42 · 2026-01-28T20:10:05 1769631005

Yea but humans would have fixed it, this person didn't even bother. Straight copy and paste.