More

GrayShade · 2026-03-24T15:18:41 1774365521

1.82.7 doesn't have litellm_init.pth in the archive. You can download them from pypi to check.

EDIT: no, it's compromised, see proxy/proxy_server.py.

cpburns2009 · 2026-03-24T15:20:03 1774365603

1.82.7 has the payload in `litellm/proxy/proxy_server.py` which executes on import.

GrayShade · 2026-03-24T07:30:46 1774337446

It's gone: https://github.com/bjarneo/cliamp/commit/085734a86343a80176d...

Habgdnv · 2026-03-24T12:33:05 1774355585

I just woke up this morning and I am amazed. I am taking all my nasty words back and I starred the project and followed the author who reacted so fast to my dull negative feedback and this reaction shows how much he cares about the project.

taneliv · 2026-03-24T08:33:47 1774341227

Thanks for pointing this out, to me it seems quite a good response.

I wouldn't mind opt-in telemetry, but possibly the participation rate would be too low to make use of it.

stavros · 2026-03-24T09:52:17 1774345937

My issue with telemetry is that 99% of software ends up not using it. Why have it? And definitely don't have it by default. Your users will come tell you what they want, making telemetry useless, especially when it's an OSS project you're mostly building for yourself.

rob74 · 2026-03-24T12:03:20 1774353800

Except that telemetry can give you more complete (and foolproof) information than what users report. But yeah, that could also be solved by having debug info that users can attach to their report, the app doesn't have to "call home" for that...

stavros · 2026-03-24T12:13:48 1774354428

I agree, but it's a cost/benefit thing. Most OSS projects aren't big enough to do anything with the telemetry, so you're just paying in goodwill for no reason.

kgwxd · 2026-03-24T10:43:17 1774348997

Opt-in via extension, fine. Opt-in via flag, unreliable. The spyware code should never be anywhere near the main codebase.

tosti · 2026-03-24T08:40:26 1774341626

Yay! Also, I hadn't noticed an entire section about building from source. Sorry about that. Good work!

blamesoft · 2026-03-24T09:15:45 1774343745

Woo, good on them

GrayShade · 2026-03-21T10:37:38 1774089458

Don't forget location.

GrayShade · 2026-03-21T10:31:05 1774089065

It doesn't, and it's optional.

GrayShade · 2026-03-13T17:12:09 1773421929

That's not a shell, it's a Python interpreter compiled to WASM and running in the browser.

eduction · 2026-03-18T19:31:43 1773862303

Whatever you call it, it’s plainly calling repeatedly to the server once loaded. You couldn’t just throw it on GitHub or cloudflare as is.

GrayShade · 2026-03-13T16:08:37 1773418117

This feels a bit pessimistic. Qwen 3.5 35B-A3B runs at 38 t/s tg with llama.cpp (mmap enabled) on my Radeon 6800 XT.

Aurornis · 2026-03-13T17:13:34 1773422014

At what quantization and with what size context window?

GrayShade · 2026-03-13T18:43:59 1773427439

Looks like it's a bit slower today. Running llama.cpp b8192 Vulkan.

$ ./llama-cli unsloth_Qwen3.5-35B-A3B-GGUF_Qwen3.5-35B-A3B-UD-Q4_K_XL.gguf -c 65536 -p "Hello"

[snip 73 lines]

[ Prompt: 86,6 t/s | Generation: 34,8 t/s ]

$ ./llama-cli unsloth_Qwen3.5-35B-A3B-GGUF_Qwen3.5-35B-A3B-UD-Q4_K_XL.gguf -c 262144 -p "Hello"

[snip 128 lines]

[ Prompt: 78,3 t/s | Generation: 30,9 t/s ]

I suspect the ROCm build will be faster, but it doesn't work out of the box for me.

GrayShade · 2026-03-10T07:06:27 1773126387

And footnote 3 is unreferenced.

philippemnoel · 2026-03-10T17:37:50 1773164270

Good catch, thank you both -- fixed!

GrayShade · 2026-03-07T12:14:06 1772885646

It did, there's two incompatible approaches, zram and zswap.

GrayShade · 2026-03-03T12:21:19 1772540479

The Signal device linking feature is just as fast. It's partly a trick -- it will look for QR codes even outside the central area, so under good conditions it can get a read before you even get a rough orientation.

GrayShade · 2026-02-19T16:14:04 1771517644

Maybe that's the only API-visible change, saying nothing about the actual capabilities of the model?