The code-agent closed loop continues to deepen
ContinuingCompared with the “repo-level closed loop” in Code agents enter real engineering loops (2026-W10) built around RAIM, BeyondSWE, and Echo, this main thread continued to…Read full rationaleCollapse
Compared with the “repo-level closed loop” in Code agents enter real engineering loops (2026-W10) built around RAIM, BeyondSWE, and Echo, this main thread continued to strengthen this week, but the center of gravity expanded from repository execution to training and release processes themselves. SWE-Fuse pushes a 32B open-source model to 60.2% on SWE-bench Verified, indicating that gains increasingly come from trajectory design and weakly supervised repair training. Understanding by Reconstruction then uses trajectories of requirements, planning, reading, writing, and debugging for continued pretraining, and ExecVerify further plugs verifiable stepwise rewards into code execution reasoning. By the weekend, LLM-Augmented Release Intelligence had reduced submission input volume by 40–60% on a platform with 60+ tasks and 20+ pipelines, showing that the closed loop has extended from bug fixing toward release collaboration.