Though, since I specifically mentioned agentic, I wanted to exclude non-agentic tools like prompt builders and context managers that you linked. :)
Reason being: my idea of agents is to generalize well enough, so the need for workflow based apps isn't needed anymore.
During discovery and planning phase, the agents should traverse the code base with a retrieval strategy packaged as a tool (embedded search, code-graphs, ...) and then add that new knowledge to the plan before executing the code changes.
Heading there? Facebook has been a kingmaker for a decade. Musk runs DOGE. Most big companies can bully smaller administrations when they feel the need.
It's kind of interesting if you view this as part of RLHF:
By processing the system prompt in the model and collecting model responses as well as user signals, Anthropic can then use the collected data to perform RLHF to actually "internalize" the system prompt (behaviour) within the model without the need of explicitly specifying it in the future.
Overtime as the model gets better at following its "internal system prompt" embedded in the weights/activation space, we can reduce the amount of explicit system prompts.
With 16x Eval, you can manage your prompts, contexts, and models in one place, locally on your machine, and test out different combinations and use cases with a few clicks.
Also missing a class of non-IDE desktop apps like 16x Prompt and Repo Prompt.
reply