Code intelligence evaluation shifts toward real engineering: end-to-end delivery, long-term maintenance, and production supervision advance together
Today's code research is tightly concentrated around one theme: evaluation is moving closer to real software engineering. Papers are no longer satisfied with whether a model can "solve a single problem correctly," but…