Spam accounts overwhelmed my database. Claude found the weaknesses, Codex wrote the fixes, and I deployed a new defense.
New benchmarks show semantic code graphs helping coding agents find change locations faster and complete updates more ...