OpenDevin – open-source implementation of devin

wokwokwok · 2024-03-24T07:27:52 1711265272

There are two parts to “writing a Devin”.

1) a pretty user interface, a backend service layer, all the normal things you’d need for a desktop app or saas.

2) a backend that can actually do work.

I’m going to go out a limb here and say, building an MVP of the UI is a waste of your time.

…because, unless you know how to build the backend (ie. part 2), you actually, have no idea what you need to build for part one.

You can copy of the UI from the Devin videos.

You can build your own langchain framework.

You can fine tune an open model on GitHub issues.

…but just like having building a gpt4 is harder than just “add more params”, building something that works like Devin appears to work requires a reasonably sharp “step up” in capability between what literally everyone has been doing with gpt4 until now, and being able to turn that into a useful framework for solving actual engineering tasks.

So… don’t hold your breath. If you see someone building a UI (like this https://github.com/OpenDevin/OpenDevin/tree/main/frontend, https://github.com/stitionai/devika/tree/main/ui; just read the commit log, it’s basically just ui) it means they’re doing the easy work (part 1) because they don’t know how to do the hard part (part 2).

…so, interesting, but this doesn’t smell like a really serious effort (at least yet).

I guess you could argue that it’s important “setup infrastructure” stuff that any project starts with… but I’m just sceptical.

I can draw pictures of a Devin too. A serious effort would be trying to replicate what Devin does not what it looks like.

mellosouls · 2024-03-24T10:36:38 1711276598

So… don’t hold your breath. If you see someone building a UI (like this https://github.com/OpenDevin/OpenDevin/tree/main/frontend, https://github.com/stitionai/devika/tree/main/ui; just read the commit log, it’s basically just ui

Yes. If you link to the UI sections of a repo you will likely see "basically just ui" commit history.

I guess you could argue that it’s important “setup infrastructure” stuff that any project starts with… but I’m just sceptical.

Of course it's an early stage pilot, it's only been developed since the Devin hype, why should it compete in quality at this very early time?

moffkalast · 2024-03-24T10:44:16 1711277056

Well I don't see why not set up the UI part first if that's what currently possible. There's no shortage of LLM frontends right now but practically all of them are very meh (if I see another overused default theme Gradio app I will stab my eyes out).

When Zuck releases Llama 3 in July or whenever we might have something to plug this into.

CGamesPlay · 2024-03-24T12:05:42 1711281942

One reason why: they spent their launch hype on vaporware. If they had done the hard part and then launched with a default theme Gradio app around that, it would be much better for gathering excited developers.

rbren · 2024-03-27T03:32:25 1711510345

OpenDevin dev here. I created the default agent.

We're very aware that we'll need great agents to be able to compete with Devin and others. We're currently setting up evaluation pipelines to evaluate various agents against SWE-bench.

Our thesis is that a community experimenting with various agents and agent architectures will outpace a private company on a single track. We're building the notion of an "agent hub" out of the gates--anyone can plug into the Agent interface and contribute their work. We're also discussing how to build a meta-agent, which farms out specific tasks to sub-agents.

It's early days though--we've only just gotten things wired together in a sort-of working demo. Stay tuned!

NayamAmarshe · 2024-03-24T09:02:07 1711270927

Isn't Devin a GPT-4 wrapper too, with an increased context size?

Garlef · 2024-03-24T12:05:03 1711281903

Isn't a web browser just a wrapper around http requests, with some memoization and rendering?

NayamAmarshe · 2024-03-25T11:18:43 1711365523

Completely different things with different levels of complexities. The LLM is the one doing the heavy-lifting in the case of an agent. One does not need to build an LLM from scratch, but they do with a web-browser. With the LLM, the work is already 70% done, the rest is figuring out how to create an agent and fine-tune the params.

jokethrowaway · 2024-03-24T08:47:55 1711270075

I don't get what's special about Devin. I noticed the same logic issues I've seen with ChatGPT or Copilot or Gemini.

OpenAI already did spend quite a bit on software development reinforcement learning and we already have models with huge context windows.

We've already seen OSS projects where ai agents collaborate to create code projects.

If you use CodeLlama, throw some engineering manpower to create feedback loop with compilers, linters and other tooling to improve the code generated you can probably match Devin's performance. I can get some pretty good results feeding back compiler's output to ChatGPT, if that were automated, my impression of ChatGPT would be much better.

So my bet is that an open source Devin is absolutely possible.

wokwokwok · 2024-03-24T09:07:07 1711271227

> I don't get what's special about Devin.

Did you watch the YouTube videos?

I’m also skeptical it can do all the things it appears to be able to do, but I think it’s undeniable that what they show it doing (totally unverified as it is)…

…is significantly more capable than any other programming agent I’ve seen.

If openAI had that capability, they would be selling it.

They are not. I doubt very much that they’re just hoarding it and keeping it to themselves; gpt4 just can’t do it out of the box.

It is not that simple.

It is not just a fancy prompt for gpt4.

If it was that easy, someone would have done a year ago.

> If you use CodeLlama

Oh come on. These pathetic little open source models are shit compared the vendor offerings.

This is daydreaming about what might be, not what actually exists.

mellosouls · 2024-03-24T10:28:05 1711276085

If it was that easy, someone would have done a year ago

Devin isn't as original as it's marketing claims, there are prior tools in this space (eg. Sweep). I've no idea whether it's significantly better than them, but certainly the magic is the underlying LLM, not Devin.

No reason at all why other devs - including open source - can't continue to provide competitors at that same surface level.

paradite · 2024-03-24T10:41:22 1711276882

Yes. I can't believe the amount of negative comments on Devin or just brushing it off.

It is hard to build something like that.

If it's just a GPT wrapper, we would have tons of indie hackers building competitors and making 10k MRR by now.

The agents, prompts and orchestration are difficult problems to solve.

On the other hand, you have people thinking it's impossible to build something like that. I also disagree.

From a complexity perspective, coding a piece of software is no harder than driving an autonomous vehicle, since the logic is fixed and the environment is deterministic. If we can get to L5 for AV, I don't see why we can't get to L5 for coding.

gerroo · 2024-03-25T17:34:04 1711388044

I wholeheartedly agree. I'm trying to get involved with this project and thought that it was an effort in the right direction. Appreciate the insights.

If they let me I'll get to work on the "hard part."

molticrystal · 2024-03-24T06:43:29 1711262609

I saw that there is another project announced around the same time with the same goal, and has a discord as well. Maybe if they align enough they could collaborate.

>Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

https://github.com/stitionai/devika

zmmmmm · 2024-03-24T07:52:53 1711266773

We really do spend half our time chiding everyone for anthropomorphising the AI and then the other half giving them human names and trying to make them explicitly emulate human behaviours that they aren't intrinsically good at.

karolist · 2024-03-24T09:08:38 1711271318

There's nothing really there yet besides the readme file. I saw this the other day on GitHub trending and left with mixed feelings. You can launch an empty repo riding the clicks and sharing what essentialy is a parked domain template, as long as the keywords match the narrative.

ramon156 · 2024-03-24T09:54:22 1711274062

I can't really expect Devin to be remade overnight, but I have my doubts too

habitue · 2024-03-24T06:25:33 1711261533

Gotta say, I admire the chutzpah. Commoditize the commoditization software, it's the most open source thing ever

meek4242 · 2024-03-27T18:21:41 1711563701

The UI is throwing an error relating to not being able to pull the image for: ghcr.io/opendevin/sandbox:v0.1

(base) OpenDevin % docker login ghcr.io Authenticating with existing credentials... Login Succeeded (base)OpenDevin % docker pull ghcr.io/opendevin/sandbox:v0.1 Error response from daemon: denied

error: File "/Users/meek/anaconda3/lib/python3.11/site-packages/docker/models/containers.py", line 876, in run self.client.images.pull(image, platform=platform) File "/Users/meek/anaconda3/lib/python3.11/site-packages/docker/models/images.py", line 464, in pull pull_log = self.client.api.pull( ^^^^^^^^^^^^^^^^^^^^^ File "/Users/meek/anaconda3/lib/python3.11/site-packages/docker/api/image.py", line 429, in pull self._raise_for_status(response) File "/Users/meek/anaconda3/lib/python3.11/site-packages/docker/api/client.py", line 267, in _raise_for_status raise create_api_error_from_http_exception(e) from e ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/meek/anaconda3/lib/python3.11/site-packages/docker/errors.py", line 39, in create_api_error_from_http_exception raise cls(e, response=response, explanation=explanation) from e docker.errors.APIError: 500 Server Error for http+docker://localhost/v1.44/images/create?tag=v0.1&fromImage=ghcr.io%2Fopendevin%2Fsandbox: Internal Server Error ("denied")

dingi · 2024-03-24T07:35:57 1711265757

[flagged]

Udo · 2024-03-24T08:14:33 1711268073

Seems to me that the most subversive thing you can do in a gold rush is to open source the shovels.

mhuesler · 2024-03-24T11:42:46 1711280566

fuck you devin

captainbland · 2024-03-24T11:46:44 1711280804

Ach, programmers! You ruined programming!