Essay

What actually breaks after vibe coding

Not the demo. Not the first happy-path click. The parts that break are the ones product trust quietly depends on: auth, billing, environment assumptions, failure recovery, and ownership.

1. Auth gets weird before founders can explain why

Sign-up flows often look complete until they meet edge cases: role confusion, recovery paths, invite logic, or mismatched assumptions between frontend state and backend enforcement.

The founder experiences this as “auth is flaky.” The deeper issue is that authentication is rarely just one thing. It is a chain of decisions, and AI-assisted building often accelerates implementation faster than that chain is being reasoned about.

2. Billing exists, but nobody trusts it

Billing is one of the most common “looks done, feels unsafe” systems in half-built apps. The UI may be there. The provider may be connected. The problem is everything around the happy path: failure states, entitlement changes, retries, cancellations, downgrades, and the operational visibility to know when something quietly went wrong.

3. Deploys turn product confidence into roulette

The app works locally. Then staging behaves differently. Then production depends on environment variables, migration order, or infrastructure assumptions nobody wrote down clearly.

At that point, shipping becomes emotional. Teams are not only asking “does this work?” They are asking “do I trust us to touch this?”

4. The code becomes socially fragile

The product might still be valuable, but fewer and fewer people feel confident changing it. This is one of the hidden costs of AI-assisted speed: the app can be real before ownership is real.

That is when every small change starts carrying too much psychological weight.

The practical takeaway

The first version of an AI-built app usually does not fail because the idea is bad. It fails because confidence, observability, and decision quality did not keep pace with implementation speed. The fix is rarely “throw it all away.” The fix is usually a sharper diagnosis of what is salvageable, what is dangerous, and what actually has to be hardened before the app deserves more trust.