> I call it program synthesis. In ChatGPT you can get ChatGPT to generate code, ...

samsquire · on June 20, 2023

Thank you for your reply and thank you for links to 3 (!) whitepapers.

What do you think is a good behavioural specification? I have often thought that: given a log (read: a highly accurate trace of what the program did) of a program, the log indicates what the program does and there are relations between the fields in the log and log lines.

If some program generates the same log, with the same input and output, is the behaviour of that program identical?

Now I want to do these things:

* provide example logs, which are desired behaviour and let the computer work out the code to fulfil that example

* combine the behaviours of one or more programs

* convert log into a tree or graph that resembles invocation stack (for functional application synthesis, such as "this log resembles a post order traversal" or "a normal form")

* tweak the behaviour of one program with the behaviour of another program, "use one program as a tool in the other program"

Could we wire up the logs and cause things to the code that generated them? The log is a bidirectional view into the program's operations and code that generated it.

In other words, modify code and behaviour by modifying behaviours directly and rely on causality feeding backwards through a chain of logic.

abeppu · on June 20, 2023

I think different contexts call for different kinds of specification, but most commonly, I do think "synthesize a _function_ which for inputs x1, x2, ..., xk produces outputs y1, y2, ..., yk respectively", is a pretty good setup provided you then are willing to test it on xk+1 ... xn. The "programming-by-example" research direction aligned nicely with TDD, and functions provide a nice abstraction layer.

I _don't_ think a detailed program trace is the best "specification" in most cases because constructing that log includes making a lot of choices of _how_ the program arrives at its outputs. The full trace for meaningful programs might be quite large, and onerous to specify (or you'd just produce one from an already-working program, in which case what's the point?).

For me, the benefit of synthesis should be that the programmer can describe what should be done, rather than how. However, this can quickly lead to a complex "specification language" which can be just as burdensome to write in as the desired target language, which is why examples are appealing. But perhaps some combination, where we provide some examples and also some formally specified restrictions ("the `get_work_history` method returns `jobs: List[Jobs]` such that `map(_.start_date, jobs)` is non-decreasing according to the default comparator on Dates ...") is best, since examples will generally underspecify the program.

Update: the 'different contexts' I think is mostly that sometimes, some specific attributes of 'how' the synthesized program accomplishes its goals do matter -- e.g. you may want to synthesize some mathematical optimization code which really ought to use the GPU, and that isn't indicated in just input/output examples, or you may want to ensure that part of an embedded system uses constant memory and returns after a constant number of steps.