A Developer’s Review of 3 Popular AI Agents: ChatDev, SWE-Agent & Devin

Guilherme Assemany

Senior Developer

Listen to this article

Artificial Intelligence has been shaking up nearly every industry, and software development is no exception. Companies are constantly looking to optimize their processes and reduce costs, and this is where the promise of integrating AI into the daily routine of software developers shines the brightest.

These tools not only boost developer productivity but also transform what we know about software development and its stages.

Table Of Contents

The Rise of AI in Software Development: From 1943 to Today
Current AI Tools and Technologies for Development
Benefits and Challenges of Using AI Agents
The Main Event: Comparing AI Agents ChatDev, SWE-Agent, and Devin
ChatDev
Devin AI
SWE-Agent
Final Thoughts on AI Agents
About the author: Guilherme Assemany

Imagine a world where bug fixing, code generation, and even sprint planning are done automatically and accurately. Does that sound too futuristic? It might, but this is the reality that many AI agents are proposing to bring into the present. Despite the excitement, however, adopting these agents comes with its own set of challenges and fundamental questions.

For me, the main questions are:

How effective are these AI dev agents, really?
Is it worth investing in them right now?
How do they compare to one another?

In this article, my goal is to dive into the world of AI agents for software development. We’ll provide an overview of these tools at large, discussing their capabilities, benefits, and challenges. Finally, we’ll provide a detailed analysis of three popular tools: ChatDev, SWE-Agent, and Devin. So, if you want to understand how artificial intelligence is reshaping the way we develop software, keep reading!

Table summarizing the key features and purpose of chatdev, swe-agent, and devin — ChatDev, SWE-Agent, and Devin are all AI Dev Agents with varying key features and intended purposes.

The Rise of AI in Software Development: From 1943 to Today

The origins of artificial intelligence can be traced back to 1943, when Warren McCulloch and Walter Pitts created the first computational model for neural networks. At the time, “artificial intelligence” – a term that is overwhelmingly popular today – wasn’t used; however, this is still credited as the very foundation of AI.

Photo of Walter Mccullough and Walter Pitts — Walter McCullough and Walter Pitts were credited with founding the first computational model for neural networks

In the 1990s, the first coding assistants emerged: this included tools like IntelliSense, which helped developers write code faster and with fewer errors.

Starting in the mid-2010s, the use of machine learning and natural language processing (NLP) began to gain traction, allowing development tools to become smarter and more helpful in suggesting autocompletions, identifying areas for improvement in the code, and spotting potential bugs.

A visual timeline of the founding and development of ai tools — AI tools have come a long way since their founding in 1943, following a surge in advancement in the early 2010s.

Today, we are in an era where AI not only assists us, but also has the potential to co-create and suggest real-time solutions, elevating the developer’s role to a new level of productivity and efficiency.

Hire Top Remote Machine Learning Engineers, Quickly and Easily

Leverage our years of expertise and powerful platform to find the best-suited candidates

Hire ML Engineers

Current AI Tools and Technologies for Development

Tools like Cursor and GitHub Copilot, which use OpenAI Codex technology, are being widely adopted to autocomplete, suggest code, and serve as personalized development companions. Other platforms, like Tabnine and Replit Ghostwriter, offer personalized assistants that adapt to the user’s coding style and provide context-aware project information.

Logos of popular code autocomplete tools and ai dev agents — Code autocomplete tools are already widely used by software developers. Today, we’re seeing the rise of AI Agents.

In addition, new AI Agents are emerging that represent a new category of tools designed not just to write code, but to interact with the developer, understand project context, provide detailed suggestions, and even assist with planning, executing more complex tasks, and testing. These tools, which are the focus of this article, include ChatDev, SWE-Agent, and Devin.

Benefits and Challenges of Using AI Agents

The benefits of AI agents in software development are well discussed: increased productivity, fewer errors, and a more streamlined workflow. Developers can focus on more strategic tasks, leaving repetitive, tedious, or low-value tasks to AI. Additionally, these agents can act as virtual mentors, helping junior developers learn good coding practices, much like a productive pair programming session with a more experienced developer would.

However, there are significant challenges. The adoption of AI agents raises a plethora of concerns, including:

An over-reliance on technology,
Privacy and data security issues,
Ensuring that the suggestions made by the agents are genuinely useful and do not compromise the quality of the software produced.

Indeed, these points alone, if discussed in depth, could merit an entire article. So, I won’t delve too deeply into these more philosophical issues for now. Instead, let’s explore Dev AI Agents.

Today’s AI Agents: What Can They Really Do?

AI agents are being used for a variety of tasks in software development, from writing and refactoring code to automated testing and even detecting vulnerabilities caused by poor development practices. Many companies are adopting these tools to improve team collaboration, accelerate delivery times, and reduce operational costs.

The promise of radically transforming the development workflow has stirred significant excitement in the tech community. The potential for continuous collaboration between humans and machines, where AI is not just a tool but a true coding partner, is driving interest and investment in this field. This vision of a more efficient and collaborative future is what fuels the enthusiasm surrounding these innovations.

The Main Event: Comparing AI Agents ChatDev, SWE-Agent, and Devin

I want to take a closer look at three AI agents for developers: ChatDev, SWE-Agent, and Devin AI. The goal here isn’t to directly compare these tools to declare a winner, mainly because, as you’ll see, they don’t all have the same focus or even the same implementation approach.

By the end of this text, my aim is for you to have a clearer understanding of these agents and a good starting point to explore which ones might make the most sense for your daily reality — and, of course, to decide if it’s worth investing your time in them now.

ChatDev

Introduction to ChatDev

Developed by OPEN BMB (Open Lab for Big Model Base), ChatDev is one of the most intriguing AI agents I’ve encountered. The idea is to simulate a software company operated by various intelligent agents, each playing different roles within the organization. These agents include positions like CEO, Chief Product Officer (CPO), Chief Technology Officer (CTO), programmer, reviewer, tester, and art designer. And of course, all these agents work together to bring your idea to life.

Each agent is responsible for specific tasks such as design, coding, testing, and documentation, creating an automated and cohesive workflow. The functionality of ChatDev is accompanied by a charming interface, which uses pixel-style drawings to visualize this virtual company.

For now, using ChatDev is free, but you will need to integrate it with the OpenAI API, so you’ll have to provide an API key. ChatDev on GitHub boasts nearly 25,000 stars, making it one of the standout AI agents in the field.

Screenshot of ChatDev, a virtual software company with intelligent AI agents — ChatDev offers a virtual “software company” with agents that help build, test, and deploy code.

First Impressions: ChatDev

There are two ways to test ChatDev: via the web and via the terminal. For the web option, you need permission from the company, so if you’re eager to use it, I recommend following the instructions available in the project’s repository — it’s quite quick and simple.

I tested this tool through the terminal, and I won’t go over the installation steps since they are already well documented in the repo.

Once configured, the first step is to run a command to provide your instructions to our “virtual company.”

I’ll ask the AI to help me create a to-do application using AlpineJS. I won’t specify the framework version to see how it handles that aspect as well. The prompt will be as follows:

Create a to-do list application using Alpine.js that allows users to add new to-do items through an input field and either pressing “Enter” or clicking an “Add” button, enabling deletion of individual to-do items by providing a “Delete” button next to each item, marking to-do items as completed by including a checkbox or similar mechanism that updates the item’s status visually upon selection, and implementing a filter feature to switch between showing all items or only those not yet completed. The app should update dynamically, reflecting changes instantly without requiring a page refresh, and should be built entirely with Alpine.js to handle all these functionalities.

Running a command in the Chatdev terminal — Entering a prompt in the ChatDev terminal effectively provides instructions to our “virtual company.”

After executing this command, you’ll see a bunch of information streaming through your terminal. It’s a bit challenging to keep up with everything in this view, and personally, I found it a bit frustrating at this point, especially if you like to know the details of what’s happening.

A command being executed in the chatdev terminal — After running a command, ChatDev’s terminal shows a series of executions as individual lines.

But if you’re like me, there’s some good news: there’s a mini web application that allows you to “replay” every action, so you can see exactly what steps are taken to create your software. We’ll talk more about that in a moment, but first, I want to comment on the final result that’s displayed in the terminal:

Final result of Chatdev command in the terminal — The output of my command in my terminal. ChatDev summarizes what has been built, including lines of code, total duration, and number of files.

I genuinely found it quite impressive how ChatDev summarizes what has been built. You get insights into the real cost of producing the software (keeping in mind the cost due to the OpenAI API key), the number of files, lines of code, lines of documentation, total duration, and even the number of tokens used.

It’s true that some of the parameters seem to have glitches — for example, in this case, it reported 0 lines of code, when in reality, combining the HTML, JS, and CSS, we have around 100 lines.

Reviewing the Application

Now it’s time to check out the product that was created. Since our application is basically a web page, there’s no need for any configuration or environment setup — just open the main file in your browser.

Simple to-do list app generated with ChatDev — The app I built with ChatDev is a simple to-do list (without styling).

The generated app works quite well — you can add, delete, and mark to-dos as completed. Additionally, there’s a filtering feature. For some reason, ChatDev created the filter in a different view, which is why we see a “duplicated” list. Perhaps my prompt wasn’t specific enough.

Visually, the application is very basic, but that’s not an issue; the prompt didn’t mention anything about making the app look nice, just about the functionalities and using Alpine.js.

One thing that caught my attention is that the version of Alpine.js used was 2.8, not the latest version, which at the time of this writing is @3.14.1.

Overall, the result was well-written code — simple and straight to the point. But what I liked most was the manual.md file, which contains instructions for running and understanding the app’s functionalities. Of course, for a to-do list, this might not be necessary for most users, but imagine the power of this in a larger application! Creating documentation becomes a much simpler task.

Inside the Process

Remember I mentioned the “Replay” feature for the project-building process? This is possible through a web interface specifically designed for that. In my environment, I ran the command python3 visualizer/app.py, and I quickly accessed this tool. Along with the generated project, you’ll always have a LOG file, and this file is what you use to replay the process.

Chatdev replay feature showing a LOG file — ChatDev has a built-in “Replay” feature, that allows you to see the project’s log files.

ChatDev Wrap-Up

ChatDev is a tool that is quite easy to use if you have some background in development. With the standard usage, you can’t actually interact with your virtual company to refine instructions; for that, you need some additional setup. Another feature I missed is the ability to extend or edit already completed projects.

In its current stage, it is certainly an interesting tool that can help you quickly create certain functionalities or proofs of concept.

What I liked the most was how straightforward it is to get started with the tool, as well as the comprehensive documentation provided. I think ChatDev is a promising tool that is definitely worth keeping an eye on. Additionally, it’s kind of fun to see it in action and get an idea of how each agent collaborates with each other assuming their roles in the company.

Devin AI

Introduction to Devin AI

Devin AI is marketed as “the world’s first AI software engineer.” It’s an innovative AI agent designed for collaborative development, offering a community-centered approach where developers can share tips, best practices, and even AI-generated code snippets, fostering a learning and collaboration environment.

First Impressions

When I wrote this article, Devin was free to try – all I had to do was request access through a form to gain access to their web interface.

In December 2024, however, Cognition (the team behind Devin) made the AI Agent generally available, with plans starting at $500/month. It’s a bummer for any solo developers that want to test it out, though the paid plan does give your entire team access. It also meant that Devin is now capable of integrating with your team’s GitHub and Slack.

Currently, Devin is only available via a web application. You’ll have access to an interface similar to ChatGPT, but more specialized, with features such as a dedicated terminal for your project, a browser to view web applications (if applicable), and even a code editor so you can build your application alongside Devin.

To create a better basis for comparison, I’ll use the exact same prompt I gave to ChatDev earlier.

Screenshot of Devin, AI Dev Agent, interface — Devin AI is a web-based tool with an interface similar to ChatGPT.

As you might have noticed, Devin’s initial interface feels quite similar to ChatGPT’s — it looks like just a space to enter your prompt and wait for the output, right? However, once Devin starts working on your project, it becomes clear that it offers much more than that, and this is one of my favorite aspects of Devin.

Devin AI interface after entering a prompt — Once Devin begins working on a project, you’ll find four main views: shell, browser, editor, and planner.

As soon as Devin begins working, you’ll notice some options appear in the interface.

You have four main views:

Shell
Browser
Editor (An online VSCode where you can actually edit the code and keep iterating with Devin to build your final product)
Planner (Which explains each step defined to break down the feature you’ve requested)

The best part is that you can follow along in real time with what Devin is doing. If, at any point, you add more information to the prompt, Devin will take that into account and adjust its work accordingly.

The potential of this tool is simply fantastic.