More

qd011 · on Nov 30, 2023

Good news is that a year in consulting isn't long enough to look like a detriment. Bad news is that the job market is crap right now.

> Is anyone aware of any firms that do formal-methods-like activities

Don't limit yourself to what you studied in your PhD. A big chunk of the commercial research position will be focused on AI.

> Has anyone got experience making a shift from a non-technical to a technical role?

Don't think of it as a shift from non-technical to technical role. Think of it as "I finished my CS PhD about a year ago and now I'm looking for a research/software job".

qd011 · on Nov 30, 2023

It's not as big of a leap from using Google or Stackoverflow as it might seem on the surface.

latexr · on Nov 30, 2023

Yes, it is. And I say this as someone who explored significantly more than the surface. Even the way you “search” is different, as is the number of results and contrasting opinions you see.

qd011 · on Nov 29, 2023

> Today I got technical feedback on an assessment (making a gRPC server in Rust)

Did you interview at Helsing? Pretty sure that their test was something like this.

distortionfield · on Nov 29, 2023

It was not Helsing, sorry. The challenge was to build a zero-knowledge implementation from scratch as an auth/login server. I was able to get it working, but I spent a non-trivial amount of time on it, which is the real story of my job search lately. It's a part-time job just to keep up with code challenges while I'm applying this often for work.

qd011 · on Nov 29, 2023

I don't understand why Python gets shit for being a slow language when it's slow but no credit for being fast when it's fast just because "it's not really Python".

If I write Python and my code is fast, to me that sounds like Python is fast, I couldn't care less whether it's because the implementation is in another language or for some other reason.

kbenson · on Nov 29, 2023

Because for any nontrivial case you would expect python+compiled library and associated marshaling of data to be slower than that library in its native implementation without any inyerop/marshaling required.

When you see an interpreted language faster than a compiled one, it's worth looking at why, because most the time it's because there's some hidden issue causing the other to be slow (which could just be a different and much worse implementation).

Put another way, you can do a lot to make a Honda Civic very fast, but when you hear one goes up against a Ferrari and wins your first thoughts should be about what the test was, how the Civic was modified, and if the Ferrari had problems or the test wasn't to its strengths at all. If you just think "yeah, I love Civics, that's awesome" then you're not thinking critically enough about it.

Attummm · on Nov 29, 2023

In this case, Python's code (opening and loading the content of a file) operates almost fully within its C runtime.

The C components initiate the system call and manage the file pointer, which loads the data from the disk into a pyobj string.

Therefore, it isn't so much Python itself that is being tested, but rather python underlying C runtime.

kbenson · on Nov 29, 2023

Yep, and the next logical question when both implementations are for the most part bare metal (compiled and low-level), is why is there a large difference? Is it a matter of implementation/algorithm, inefficiency, or a bug somewhere? In this case, that search turned up a hardware issue that should be addressed, which is why it's so useful to examine these things.

heavyset_go · on Nov 29, 2023

If you're staying within Python and its C-extensions, there is no marshalling, you're dealing with raw PyObjects that are exposed to the interpreter.

lmm · on Nov 30, 2023

> Because for any nontrivial case you would expect python+compiled library and associated marshaling of data to be slower than that library in its native implementation without any inyerop/marshaling required.

> When you see an interpreted language faster than a compiled one, it's worth looking at why, because most the time it's because there's some hidden issue causing the other to be slow (which could just be a different and much worse implementation).

On the contrary, the compiled languages tend to only be faster in trivial benchmarks. In real-world systems the Python-based systems tends to be faster because they haven't had to spend so long twiddling which integers they're using and debugging crashes and memory leaks, and got to spend more time on the problem.

kbenson · on Nov 30, 2023

I don't doubt that can happen, but I'm also highly doubtful that it's the norm for large, established, mature projects with lots of attention, such as popular libraries and the standard library of popular languages. As time spent on the project increases, I suspect that any gain an interpreted language has over an (efficient) compiled one not only gets smaller, but eventually reverses in most cases.

So, like in most things, the details can sometimes matter quite a bit.

lmm · on Nov 30, 2023

> I don't doubt that can happen, but I'm also highly doubtful that it's the norm for large, established, mature projects with lots of attention, such as popular libraries and the standard library of popular languages.

Code that has lots of attention is different, certainly, but it's also the exception rather than the rule; the last figure I saw was that 90% of code is internal business applications that are never even made publicly available in any form, much less subject to outside code review or contributions.

> As time spent on the project increases, I suspect that any gain an interpreted language has over an (efficient) compiled one not only gets smaller, but eventually reverses in most cases.

In terms of the limit of an efficient implementation (which certainly something like Python is nowhere near), I've seen it argued both ways; with something like K the argument is that a tiny interpreter that sits in L1 and takes its instructions in a very compact form ends up saving you more memory bandwidth (compared to what you'd have to compile those tiny interpreter instructions into if you wanted them to execute "directly") than it costs.

JonChesterfield · on Nov 30, 2023

> a tiny interpreter that sits in L1 and takes its instructions in a very compact form ends up saving you more memory bandwidth

There's a paper on this you might like. https://www.researchgate.net/publication/2749121_When_are_By...

I think there's something to the idea of keeping the program in the instruction cache by deliberately executing parts of it via interpreted bytecode. There should be an optimum around zero instruction cache misses, either from keeping everything resident, or from deliberately paging instructions in and out as control flow in the program changes which parts are live.

There are complicated tradeoffs between code specialisation and size. Translating some back and forth between machine code and bytecode adds another dimension to that.

I fear it's either the domain of extremely specialised handwritten code - luajit's interpreter is the canonical example - of the the sufficiently smart compiler. In this case a very smart compiler.

JonChesterfield · on Nov 30, 2023

> On the contrary, the compiled languages tend to only be faster in trivial benchmarks. In real-world systems the Python-based systems tends to be faster because they haven't had to spend so long twiddling which integers they're using and debugging crashes and memory leaks, and got to spend more time on the problem.

This is an interesting premise.

Python in particular gets an absolute kicking for being slow. Hence all the libraries written in C or C++ then wrapped in a python interface. Also why "python was faster than rust at anything" is headline worthy.

I note your claim is that python systems in general tend to be faster (outside of trivial benchmarks, whatever the scope of that is). Can you cite any single example where this is the case?

lmm · on Dec 1, 2023

> Can you cite any single example where this is the case?

Plenty of line-of-business systems I've seen, but systems big enough to matter tend not to be public. Bitbucket's cloud and on-prem version are the only case I can think of where you can directly compare something substantial between an implementation known to be written in Python and an implementation that's known to be written in C/C++ (and even then I'm not 100% that that's what they use).

benrutter · on Nov 29, 2023

I wonder if its because we're sometimes talking cross purposes.

For me, coding is almost exclusively using python libraries like numpy to call out to other languages like c or FORTRAN. It feels silly to say I'm not coding in Python to me.

On the other hand, if you're writing those libraries, coding to you is mostly writing FORTRAN and c optimizations. It probably feels silly to say you're coding in Python just because that's where your code is called from.

zare_st · on Nov 30, 2023

There is a version of BASIC, a QuickBasic clone called Qb64 that is lightning fast because it transpiles to C++. By your admission a programmer should think that BASIC is fast because he only does BASIC and does not care about the environment details?

It's actually the opposite, a Python programmer should know how to offload most, or use the libraries that do so, out of Python into C. He should not be oblivious to the fact that any decent Python performance is due to shrinking down the ratio of actual Python instructions vs native instructions.

benrutter · on Nov 30, 2023

I think maybe it's just semantics as long as everyone agrees where the speedup is happening (at the low level language calls).

I noticed that you're pretty hard in the "basic isn't fast, the thing it transpiles to is fast" camp, but still accidentally said "there is a version of BASIC [...] that is lightning fast" which I'm not sure you think? Highlights just how tricky it is to talk about where speed lives

zare_st · on Dec 3, 2023

I agree with that.

There is clear distinction between original language design (an interpreter) and a project aiming to recreate a sub-standard of that language and support its legacy codebase via a transpiler.

rafaelmn · on Nov 29, 2023

But you will care if that "python" breaks - you get to drop down to C/C++ and debugging native code. Likewise for adding features or understanding the implementation. Not to mention having to deal with native build tooling and platform specific stuff.

It's completely fair to say that's not python because it isn't - any language out there can FFI to C and it has the same problems mentioned above.

IshKebab · on Nov 29, 2023

Because when people talk about Python performance they're talking about the performance of Python code itself, not C/Rust code that it's wrapping.

Pretty much any language can wrap C/Rust code.

Why does it matter?

1. Having to split your code across 2 languages via FFI is a huge pain.

2. You are still writing some Python. There's plenty of code that is pure Python. That code is slow.

munch117 · on Nov 29, 2023

Of course in this case there's no FFI involved - the open function is built-in. It's as pure-Python as it can get.

IshKebab · on Nov 29, 2023

Not sure I agree there, but anyway in this case the performance had nothing to do with Python being a slow or fast language.

jwueller · on Nov 30, 2023

How is it pure Python if it delegates all of the actual work to the Kernel?

munch117 · on Nov 30, 2023

All I/O delegates to the kernel, eventually.

It's pure Python in that there's no cffi, no ctypes, no Cython, no C extensions of any kind.

int_19h · on Nov 30, 2023

It's pretty hard to draw this line in Python because all built-in types and functions are effectively C extensions, just compiled directly into the interpreter.

Conversely, you can have pure C code just using PyObjects (this is effectively what Cython does), with the Python bytecode interpreter completely out of the picture. But the perf improvement is nowhere near what people naively expect from compiled code, usually.

jwueller · on Dec 1, 2023

Yes, which is why I would argue that IO is a particularly bad benchmark here, since everything is just a thin layer on top of the actual syscall, and those layers don't do any real work worth comparing.

The only thing that makes sense to compare when talking about pythons performance is how many instructions it needs to compute something, versus the instructions needed to compute the same thing in C. Those are probably a few orders of magnitude apart.

afdbcreid · on Nov 29, 2023

Usually, yes, but when it's a bug in the hardware, it's not really that Python is fast, more like that CPython developers were lucky enough to not have the bug.

munch117 · on Nov 29, 2023

How do you know that it's luck?

cozzyd · on Nov 29, 2023

Because the offset is entirely due to space for the PyObject header.

munch117 · on Nov 29, 2023

The PyObject header is a target for optimisation. Performance regressions are likely to be noticed, and if a different header layout is faster, then it's entirely possible that it will be used for purely empirical reasons. Trying different options and picking the best performing one is not luck, even if you can't explain why it's the best performing.

cozzyd · on Nov 29, 2023

I suspect any size other than 0 would lead to this.

But the Zen3/4 were developed far, far after the PyObject header...

saagarjha · on Nov 29, 2023

You can expect the Python developers to look very closely at any benchmark that significantly benefits from adding random padding to the object header. Performance isn’t just trying a bunch of random things and picking whatever works the best, it’s critical to understand why so you know that the improvement is not a fluke. Especially since it is very easy to introduce bias and significantly perturb the results if you don’t understand what’s going on.

munch117 · on Nov 30, 2023

We're not talking about random changes. We're talking about paying attention to the measured performance of changes made for other reasons.

Just like in this article. The author measured, wondered, investigated, experimented, and finally, after a lot of hard work, made the C/Rust programs faster. You wouldn't call that luck, would you? If there had been a similar performance regression in CPython, then a benchmark could have picked up on it, and the CPython developers would then have done the same.

saagarjha · on Nov 30, 2023

You can look at the history of PyObject yourself: https://github.com/python/cpython/commits/main/Include/objec.... None of these changes were done because of weird CPU errata that meant that making the header bigger was a performance win. That isn't to say that the developers wouldn't be interested in such effects, or be able to detect them, but the fact that the object header happens to be large enough to avoid the performance bug isn't because of careful testing but because that's what they ended up for other reasons, far before Zen 3 was ever released. If it so happened that Python was affected because the offset needed to avoid a penalty was 0x50 or something then I am sure they would take it up with AMD rather than being content to increase the size of their header for no reason.

munch117 · on Nov 30, 2023

What you don't see in the logs are the experiments and branches that weren't pursued further because they didn't perform well enough.

Also: If you're going to prove that changes informed by performance measurements are absent from the commit logs, then you'll need to look in the logs for all the relevant places, which means also looking at I/O and bytes and allocator code.

saagarjha · on Dec 1, 2023

Given that the performance is only affected by the size of that object header, the file I linked is all you'd need to see changes in. Look, the Python project is not picking their object sizes because it performs well on a quirk of Zen 3. End of story. I did performance work professionally in the past and now recreationally and this specific instance is 100% luck. This is not because I don't think the runtime people aren't smart or anything but this would be an insane thing to do on purpose.

adgjlsfhk1 · on Nov 29, 2023

because the offset here is a result of python's reference counting which dates ~20 years before zen3

analog31 · on Nov 29, 2023

I think the confusion comes from people not having a good understanding of what an interpreted programming language does, and what actual portion of time is spent in high versus low level code. I've always assumed that most of my programs amount to a bit of glue thrown in between system calls.

Also, when we talk about "faster" and "slower," it's not clear the order of magnitude.

Maybe an analysis of actual code execution would shed more light than a simplistic explanation that the Python interpreter is written in C. I don't think the BASIC interpreter in my first computer was written in BASIC.

zare_st · on Nov 30, 2023

Agreed. The speed of a language is reverse proportional to number of CPU instructions emitted to do something meaningful, e.g. solve a problem. Not whether it can target system calls without overhead and move memory around freely. That's a given.

insanitybit · on Nov 29, 2023

>I don't understand why Python gets shit for being a slow language when it's slow but no credit for being fast when it's fast just because "it's not really Python".

What's there to understand? When it's fast it's not really Python, it's C. C is fast. Python can call out to C. You don't have to care that the implementation is in another language, but it is.

p5a0u9l · on Nov 30, 2023

I constantly get low key shade for choosing to build everything in Python. It’s really interesting to me. People can’t break out of thinking, “oh, you wrote a script for that?”. Actually, no, it’s software, not a script.

99% of my use cases are easily, maintainably solved with good, modern Python. The Python execution is almost never the bottleneck in my workflows. It’s disk or network I/O.

I’m not against building better languages and ecosystems, and compiled languages are clearly appropriate/required in many workflows, but the language parochialism gets old. I just want to build shit that works and get stuff done.

paulddraper · on Nov 29, 2023

Yeah, it's weird.

qd011 · on Nov 29, 2023

It's not a good UI, at least not on Mac. It's kind of clunky, confusing and slow. I don't know if there are many better free alternatives though.

efxhoy · on Nov 29, 2023

psql and the meta commands like \d to view schema of a table, combined with your regular text editor and \i is IMO the greatest postgres UI.

I spend a lot of time there though so I can see the appeal of a gui for someone who only occasionally looks around in the db.

ubercore · on Nov 29, 2023

pgcli is also a great middleground between psql and gui

qd011 · on Nov 29, 2023

psql is good but how do you manage long multi line queries and saved collections of queries to reuse? or connections to multiple databases?

efxhoy · on Nov 29, 2023

I keep queries in .sql files in git repo. I run longer queries by writing them in a file and including/running it with \i. There's also \e to open $PSQL_EDITOR.

hans_castorp · on Nov 29, 2023

There are many alternative clients:

https://wiki.postgresql.org/wiki/PostgreSQL_Clients

qd011 · on Nov 28, 2023

Diet of coca-cola, sweets and dry humor.

swyx · on Nov 28, 2023

i find it funny how the bryan johnsons of the world take like 123 pills every day and optimize all the fun out of life and we still dont really really know if it works or not

and then old geezers like munger and buffett do whatever the hell they want and outlive everybody

ravenstine · on Nov 29, 2023

Jack LaLanne died at 96 years old, yet my grandpa who's eaten tons of Taco Bell for most of his life is alive at just a hair from that age. I wonder what people will think about the obsession with "longevity" in the likely outcome of David Sinclair or Andrew Huberman dying in their 80s or earlier.

BiteCode_dev · on Nov 29, 2023

Peter attia adresses that in outlive: bias survivor.

People with good longevity genetic don't pay the price as much for their behavior, so they indulge in it and stand out.

But most people don't have that, and can only emulate a little their epigenetic by exercice, diet, etc.

_p9wz · on Nov 29, 2023

There a tribe in Ecuador with reduced height studied by Walter Longo etc. They have a mutation in their growth hormone receptor, that means that they experience less effect of human growth hormone, hence reduced height. And their lifestyle is rather unhealthy: alcohol, smoking, sugars, junk food, obesity etc. Yet they rarely experience diabetes, cancer etc probably due to reduced mTOR pathway activation.

In the context of this specific post regarding Charlie Munger, you can't say that it's genetics unless you measure specific genes. He could be 100x Bryan Johnson, but Bryan Johnson at least makes his protocols open for use. And Munger didn't even bother to make a genetics test with his curiosity, thus providing no essential value to human civilization.

listenallyall · on Nov 29, 2023

Look up Jim Fixx... obsessed with running, introduced the sport to millions, constantly hyping up the health benefits... died at 52. Tragically sad.

swyx · on Nov 29, 2023

i mean, even they would say n=1 is not science. but then if longevity science cant guarantee your living slightly longer then is it even worth it lol

paulpauper · on Nov 28, 2023

genetics. they matter a ton . read the bios of really old people (100+ yrs old) and nothing really stands out beyond having a lot of family member who also lived a long time. I think a low-stress lifestyle helps a lot too.

Difwif · on Nov 29, 2023

Not to downplay everything you said but you really think Charlie Munger led a low stress life?

infecto · on Nov 29, 2023

Commenting only on their professional lives. Their investment strategy was generally not a high stress one, they clearly enjoy their work so one could assume their professional lives were generally low stress.

cezart · on Nov 29, 2023

They lived frugal lives, with everything needed covered for probably 10'000 years ahead. Large families, plenty of friends and a lot of wisdom. Yeah, I think Charlie and Warren's life's have way less stress than most people do.

throwaway2037 · on Nov 29, 2023

This is a fair point. If you exclude the early part of his life, working as an asset manager is easy work. Most jobs are much more stressful. Basically, you sit around and read annual/quarterly reports and try to find the next company to buy. To be clear, this type of asset management is similar to private equity. They are buying whole companies. This business is much lower volatility than buying/selling stocks for fund management. Also: Fewer transactions. So I would say the 2nd half of his life was low stress (outsider's view, of course).

antod · on Nov 29, 2023

Heh, I had Bryan Johnson and Brian Johnson mixed up for a while there.

qd011 · on Nov 23, 2023

Why do you? I don’t understand your logic.

qd011 · on Nov 22, 2023

5. Be part of a new team.

qd011 · on Nov 15, 2023

I genuinely think that this whole argument is a waste of time.

What matters is whether the outputs are useful and the outputs don't change based on whether you call it "thought", "AGI" or "probabilistic word selection".

verdverm · on Nov 15, 2023

At some point, they will get some basic rights, considering animal abuse is a crime, but also that we will anthropomorphize LLMs more.

Current tech is not there yet, but we should not wait to discuss either

shrimp_emoji · on Nov 15, 2023

They might, but they might not.

Trees have been given rights in some places. Some people believe dogs and cats have rights.

Humans have been not given rights in some places. Some people believe some humans don't have rights.

It's not about "sentience" or "consciousness" -- in reality, these concepts are religious ones, like the soul, and don't map to anything objectively meaningful.

A better way to think about rights, I think, is that they stem from the barrel of a gun. Can a thing reciprocate a social contract with me usefully? Can it help me if I'm nice to it? Can it hurt me if I'm mean to it? Alternatively, will entities that can help me do so if they see me help this thing or harm me if they see me harm this thing? That's all that matters; I'll be game-theoretically forced to grant it personhood. I'm programmed by my brain to show empathy to you, and, when that fails, you or others will harm me if I hurt you. Or, if I become a powerful dictator, I might execute you for being inconvenient to me. For all I know, you're a p-zombie that I could otherwise hurt with a clean conscience. None of that really matters; it's all about what I'm forced to do, from within or without.

qd011 · on Nov 15, 2023

> At some point, they will get some basic rights

I don't know how to express how deeply stupid this sounds while still being polite and constructive.

verdverm · on Nov 15, 2023

the comment you wrote adds no value

if you have nothing nice to say, say nothing at all

our ethics should lead one to believe that any intelligent entity, carbon or silicon based, deserves basic rights

qd011 · on Nov 16, 2023

Your ethics are leading you to believe this. For other people it doesn’t make sense at all that a computer program should have rights. It makes even less sense to people who know what a computer program is.

verdverm · on Nov 17, 2023

We don't give single cells rights, but we do give the complex organization of cells rights. Why should binary code never be given the same consideration as dna code? What defining factor is at play here?

To think that it is "just a program" could be like saying we are just machines without determination. This view may well become "racist" or "bigotry"

Individual ethics will determine the societal ethics that get codified into law. I have a hard time seeing how giving intelligent enough machines won't happen based on our existing ethics, laws, and history therein

> It makes even less sense to people who know what a computer program is.

I write code professionally and my beliefs are not what you claim them to be. Perhaps your opinion is the minority opinion? You should certainly not be claiming it as the de facto belief among programmers

brucethemoose2 · on Nov 15, 2023

Severance (the TV show) is a pretty entertaining exploration of this issue.

Still... Maybe its not a good analogy. LLMs are inifinitely replicable and editable. The "concious experience," if you will, is discontinuous even if you assume the architecture will advance massively. We definitely dont need to be talking about rights yet.

verdverm · on Nov 15, 2023

rights is part of ethics, which we most certainly need to be talking about

As I said, what we have today is not deserving of such considerations imho, but I do expect to see someone trying to marry an AI before I die, so this will become an issue not to far off

(in fact, someone already married an AI in Japan, and then the company that ran it closed, iirc)

qd011 · on Nov 15, 2023

Seems like a very bad reason to switch. Data engineering is different (and much worse than SWE in my opinion), and it's not like you're certain that you can avoid LC interviews if you try to switch.

altdataseller · on Nov 15, 2023

How is it much worse?

isbvhodnvemrwvn · on Nov 16, 2023

You are data plumber, lots of dealing with other people's shit and people who don't know what they want.

datadrivenangel · on Nov 16, 2023

This is correct, but that doesn't necessarily mean that you don't end up with the same thing in application engineering.

altdataseller · on Nov 17, 2023

Sounds like software engineering as well