self-driving codebases


Self-Driving Codebases

Hi friend!

Today I'm sharing a few notes on my experiments around self-driving codebases, a phrase often used by the Cursor team. If you find this interesting, share it with a friend or colleague!

Thanks for reading,
Robert


Notes: Self-Driving Codebases

The idea of a self-driving codebase might sound abstract. But with recent releases from Cursor, Claude Code, and similar tools, it is becoming more concrete.

Across the major coding harnesses, the same shift is happening: coding agents are moving from interactive assistants inside an IDE to programmable infrastructure that can be triggered and managed programmatically.

In its current form, a software release- and feedback cycle in a typical company might look roughly like this:

Production errors and explicit user feedback usually move through several handoffs: support triages the issue, a PM or planning process turns it into a ticket, a developer investigates and writes the fix, and the change then moves through the release process before reaching users.

A self-driving codebase, in contrast, might look something like this:

Instead of turning feedback and errors into a long handoff process, they can trigger a cloud agent that investigates the issue, proposes a fix, and pushes the result into the CI/CD pipeline, with or without human review.

That trigger can live outside the codebase, for example as part of an automation workflow, or it can become part of the product itself. The second version is what I am currently exploring:

In my side project Readbetter, a tool that converts newsletters into Kindle-ready EPUB files, a recurring failure mode is that Amazon rejects a generated file. The only signal is a vague email saying the file could not be processed.

Previously, that email started a lengthy manual process: tracing the affected account, locating and downloading the EPUB from the cache, running validation scripts, inspecting the issue, finding the relevant code path, fixing the bug, testing, and shipping a new release.

Now, the email triggers a cloud agent.

Email parsing is already a core part of Readbetter, so I extended that flow with a function that can call the Cursor SDK. The failure email is parsed, the relevant context is extracted, and a Cursor cloud agent is started with a pre-configured environment.

The agent receives the context and runs through the same steps I previously handled manually, inside a sandboxed cloud environment. It then proposes a fix by pushing to a feature branch and opening a PR.

That is the lite version of a self-driving codebase: a real production failure becomes a structured coding task automatically.

In my own experience, Cursor currently offers the smoothest developer experience for setting this up with very little overhead. You could build a similar loop with Claude Managed Agents, or with more configuration on top of other coding harnesses.

The broader direction is clear: AI is no longer just helping inside the coding loop; it is starting to close the loop between production signals and code changes.

Headlines & Links

  • Apple Silicon costs more than OpenRouter – not surprising: running an open source model locally will often be costlier compared to the cloud based if you fully price in the hardware and electricity
  • Codex-maxxing - Jason from the Codex team wrote a post about how he uses Codex. What I found most interesting was the non-code related work, i.e. combining it w/ Slidev to create slidedecks, or leveraging computer-use to get an Amazon refund.

Robert Bouschery c/o Kit.com 600 1st Ave, Ste 330 PMB 92768, Seattle, WA 98104-2246
Unsubscribe · Preferences

The Hybrid Engineer

A newsletter for builders at the frontier, leveraging AI to bridge software, product, and knowledge work.

Read more from The Hybrid Engineer

Edition One Hi friend! Welcome to the first edition of The Hybrid Engineer. This newsletter is about what it looks like to work, build, and think with AI. I’ll cover the tools, workflows, experiments, and lessons I come across while using AI to write code, automate work, build products, and explore where software development and knowledge work are starting to merge. The “hybrid” part has two meanings: people are crossing old boundaries between technical and non-technical work, and all of us...