Can the agent push directly to my main branch?

No. Repository changes happen only through a reviewed pull request. The agent edits inside a sandboxed clone and opens a PR; your protected branches are never written to directly.

How do I know the change actually works before I review it?

The change is verified in the sandbox before the pull request is opened — typecheck, tests, and build must pass. The PR you receive has already cleared those checks.

How does the agent avoid making huge, unreviewable diffs?

It indexes your codebase both semantically and with symbol awareness, so it can locate exactly what needs to change and update every usage. Changes stay surgical and focused rather than rewriting whole files.

What happens if an agent change turns out to be wrong?

Because every change lands as a pull request, it is fully attributable and revertible through your normal Git workflow. You revert the PR exactly as you would for any contributor.

Why an AI Coding Agent Should Only Touch Your Repo Through a Reviewed Pull Request

There is a tempting but dangerous mental model of AI coding agents: a smart entity with write access to your repository, quietly improving things. It is dangerous because it collapses two things that should stay separate — capability and authority. An agent can be extremely capable and still have no authority to change your main branch directly. The mechanism that keeps those separate is the same one you already trust for human engineers: the reviewed pull request.

The trust boundary you already have

Mature engineering teams almost never push directly to main. Work happens on a branch, it is opened as a pull request, it runs through CI, and a human reviews it before it merges. This is not bureaucracy — it is a trust boundary. It means no single actor, however senior, can unilaterally change production code without a second set of eyes and an automated check.

An AI agent should sit on the same side of that boundary as every other contributor. The agent proposes; the pull request is the proposal; you decide. Anything less asks you to extend more trust to an automated system than you would extend to a human teammate, which is exactly backwards.

The goal is not an agent you have to trust blindly. It is an agent whose work you never have to trust blindly, because every change arrives as a reviewable diff.

What "edits in a sandbox" actually buys you

Before a pull request can be opened, the work has to happen somewhere. The right somewhere is a sandboxed clone of your repository — an isolated copy where the agent can read, edit, and run code without any of it touching your real branches. This matters for two reasons.

Containment: a mistake in the sandbox is a mistake in a throwaway copy. Nothing the agent does there can corrupt your history, leak into main, or break a teammate's checkout.
Verification: a sandbox is a place to actually run things. The agent can typecheck, run your tests, and produce a build — and prove the change works before proposing it, rather than hoping it does.

That second point is the difference between a suggestion and a verified change. An agent that edits in a sandbox and runs your checks is not guessing whether its diff compiles; it has watched it compile.

Indexing the code so the changes are surgical

A pull request is only safe to review if it is small and focused. Sprawling, scattershot diffs are hard to reason about, which defeats the purpose of review. To produce tight changes, the agent first indexes your codebase — both semantically (what the code means) and symbol-aware (where things are defined and used). That index is what lets it find the three functions that actually need to change instead of rewriting a file out of caution.

Why symbol-awareness matters for safety

Renaming a function is trivial until you realize it is called in forty places. A symbol-aware index knows about all forty. The agent can make a change that is complete — every caller updated — without the change being broad. Completeness and minimalism stop being in tension.

The verification gate

Before a pull request is opened, the change runs a gate: typecheck, tests, and build. If any of them fail, that is signal, not noise — the agent can iterate inside the sandbox until the change is green, or surface the failure to you. The pull request you receive is one that has already cleared the same bar you would expect from a careful human contributor.

The lifecycle of an agent change

connect repo   → grant access to a GitHub repository
index          → semantic + symbol-aware understanding of the code
sandbox clone  → isolated copy; main branch is never touched directly
edit           → surgical changes scoped to the request
verify         → typecheck + tests + build must pass
open PR        → a reviewed pull request is the only path to your repo
you review     → approve, request changes, or close

Auditability as a first-class feature

Because the only way into your repository is a pull request, your Git history becomes a complete, honest record of what the agent did and when. There are no out-of-band changes to reconcile, no "wait, who edited this?" moments. Every agent contribution is attributable, reviewable, and revertible through the exact same tools you already use. If a change turns out to be wrong, you revert the PR — the same as you would for anyone.

No direct writes to protected branches — ever.
Every change is a diff you can read before it lands.
CI-equivalent checks run before the PR reaches you.
Full attribution and easy revert through normal Git workflow.

The cultural payoff

When an agent is constrained to the pull-request workflow, adopting it does not require a leap of faith from your team. You are not being asked to let a robot into production. You are being offered a tireless contributor that follows your existing rules — branch, verify, propose, review. The fastest way to get an engineering team comfortable with AI assistance is to make the AI play by the same rules everyone else already follows.

Speed and safety are usually framed as a trade-off. Here they are not. Sandboxed edits and automated verification make the agent fast; the reviewed pull request makes it safe. You get both because they live in different parts of the pipeline.