More

mkbosmans · 2026-05-04T06:38:13 1777876693

Nobody says they have no limitations. The question is are those limitation fundamental, i.e. can we expect improvement, say within a year.

danpalmer · 2026-05-04T06:59:52 1777877992

When I talk about fundamental limitations, I mean limitations that can't be solved, even if they could be improved.

We have improved hallucinations significantly, and yet it seems clear that they are inherent to the technology and so will always exist to some extent.

p-e-w · 2026-05-04T08:31:12 1777883472

“Seems clear” based on what?

pegasus · 2026-05-04T08:59:00 1777885140

For one, based on continuously frustrated hopes (and promises!) that hallucinations will go away.

coldtea · 2026-05-04T07:55:08 1777881308

As a general architecture, an LLM also has limitations that can't be improved unless we switch to another, fundamentally different AI design that's non LLM based.

There are also limitations due to maths and/or physics that aren't fixable under any design. Outside science fiction, there is no technology whose limitations are all fixable.

Here's one: https://arxiv.org/abs/2401.11817?utm_source=chatgpt.com

ToValueFunfetti · 2026-05-04T13:47:38 1777902458

Am I misreading that paper? They define hallucinations as anything other than the correct answer and prove that there are infinitely many questions an LLM can't answer correctly, but that's true of any architecture- there are infinitely many problems a team of geniuses with supercomputers can't answer. If an LLM can be made to reliably say "I don't know" when it doesn't, hallucinations are solved- they contend that this doesn't matter because you can keep drawing from your pile of infinite unanswerable questions and the LLM will either never answer or will make something up. Seems like a technically true result that isn't usefully true.

mkbosmans · 2026-03-15T09:59:25 1773568765

No need for radical changes.

  def visit_bf(g):
    n, children = g
    yield n
    if children:
        iterators = [iter(visit_df(c)) for c in children]
        while iterators:
            try:
                yield next(iterators[0])
            except StopIteration:
                iterators.pop(0)
            iterators = iterators[1:] + iterators[:1]

The difference between DFS and BFS is literally just the last line that rotates the list of child trees.

Python is a pretty mainstream language and even though the DFS case can be simplified by using `yield from` and BFS cannot, I consider that just to be syntactic sugar on top of this base implementation.

_dain_ · 2026-03-15T13:23:47 1773581027

Oh wow, I've never seen that "list of iterators" trick before. I always thought you needed an explicit queue for breadth-first.

Thanks!

mkbosmans · 2026-03-13T07:41:24 1773387684

Well, the article says that the effect of the impact was much larger than the scientists expected. That doesn't really give a lot of confidence in how good we are at predicting these things.

mkbosmans · 2026-03-12T08:14:39 1773303279

One of the ways that the classics can be improved is not to take the analytic ideal coefficients and approximate them to the closest floating point number, but rather take those ideal coefficients as a starting point for a search of slightly better ones.

The SLEEF Vectorized Math Library [1] does this and therefore can usually provide accuracy guarantees for the whole floating point range with a polynomial order lower than theory would predict.

Its asinf function [2] is accurate to 1 ULP for all single precision floats, and is similar to the `asin_cg` from the article, with the main difference the sqrt is done on the input of the polynomial instead of the output.

[1] https://sleef.org/ [2] https://github.com/shibatch/sleef/blob/master/src/libm/sleef...

mkbosmans · 2026-03-12T08:22:58 1773303778

I'm sorry, that second reference was actually for the 3.5ULP variant. The 1 ULP is here: https://github.com/shibatch/sleef/blob/master/src/libm/sleef...

mkbosmans · 2026-03-11T07:42:29 1773214949

That's not just a good prediction—it is literally already happening right now!

brookst · 2026-03-15T01:18:40 1773537520

Certainly!

mkbosmans · 2026-03-10T10:09:11 1773137351

Nothing on that list has been named that way by Euler himself of course.

mkbosmans · 2026-02-25T07:03:12 1772002992

That seems to be due to he microcontroller using its pins in duplex. There is indeed no radiation being emitted in that case, just the lamp and rotation.

https://news.ycombinator.com/item?id=46979936

mkbosmans · 2025-10-03T07:12:49 1759475569

Another two bytes found (I think)

  (d==0.?K*.01*h:c-c)

could become

  (d>0.?.0:.01)*K*h

ux · 2025-10-03T10:44:14 1759488254

Ah nice for noticing d!=0 is d>0. Not sure how I missed the multiplication to get rid of the vector form; I guess I was too obsessed with the x-x trick...

I added your changes to the Shadertoy version with your HN nickname. I'll integrate it to the original later.

Thanks!

mkbosmans · 2025-10-04T11:18:13 1759576693

I saw that you used `float z;` to later use `z` instead of the constant `0.`. You can also apply that to get a zero vector: `vec3 y;` and use `y` in place of `p-p`.

It seems that leaving the obsession behind some more can save another byte.

mkbosmans · 2025-09-03T06:25:26 1756880726

Especially in HPC there are lots of workloads that do not benefit from SMT. Such workloads are almost always bottlenecked on either memory bandwidth or vector execution ports. These are exactly the resources that are shared between the sibling threads.

So now you have a choice of either disabling SMT in the bios, or make sure the application correctly interprets the CPU topology and only spawns one thread per physical core. The former is often the easier option, both from software development and system administration perspective.

PunchyHamster · 2025-09-03T09:48:56 1756892936

HT cores can still run OS stuff in that case as that isn't really in contention with those. Tho I can see someone not wanting to bother with pinning

skeezyboy · 2025-09-03T10:15:12 1756894512

>Especially in HPC there are lots of workloads that do not benefit from SMT...So now you have a choice of either disabling SMT in the bios

Thats madness. Theyre cheaper than their all-core equivalent. Why even buy one in the first place if HT slows down the CPU? Youre still better off with them enabled.

mkbosmans · 2025-09-03T06:17:17 1756880237

Sort of niche indeed.

In addition to needing SMT to get full performance, there were a lot of other small details you needed to get right on Xeon Phi to get close to the advertised performance. Think of AVX512 and the HBM.

For practical applications, it never really delivered.