agents write code 10x faster but debugging is the same speed it was in 2019

Posted by notomarsol@reddit | ExperiencedDevs | View on Reddit | 13 comments

something broke in prod yesterday. by the time i opened slack there were already messages about it, and 4 PRs from the last day that could be the cause, all of them passed CI, all of them got approved, all of them looked clean when i reviewed them.

it took me 3 hours to find it. race condition in a queue handler that only shows up when two events fire within about 40ms of each other.

the agent that wrote the code couldn't help me figure out why it broke. i tried, multiple times, different prompts. the bug wasn't in the code it generated, it was in how that code talked to another service from 8 months ago that someone who already left wrote. you cant prompt your way out of "this requires understanding the full system."

and this is where i'm losing it a little. the dashboards say we"re shipping 2-3x more PRs per week now. our leadership keeps sending around the metrics. but the time i actually spend AT MY DESK has barely changed, it just got rearranged. i write less code, i sit in more incident reviews

we have all the AI stuff btw. Claude Code, Cursor, CodeRabbit, Sentry, Datadog, the usual. coderabbit catches the obvious stuff at PR time which is great because it means the bugs that DO ship are the weird ones, the system-interaction stuff, the race conditions, the cross-service timing things. that part is still me at 2am with coffee, scrolling through traces

ok so the post-incident work has gotten HARDER not easier. when juniors wrote bad code i could usually figure out their reasoning by reading the code, even if the reasoning was wrong it was there. when an agent writes bad code there's no reasoning to recover. it just pattern-matched something that almost worked

different shape of problem entirely. yeah idk. our speed is up, defect rate at first review is down. but the average time it takes me to resolve incidents has gone up and nobody wants to talk about that one because its not on the dashboard