
Regardless of the hype about these brokers being co-workers, from our experience, these brokers are likely to work finest in case you consider them as instruments that amplify present abilities, not because the autonomous co-workers the advertising and marketing language implies. They will produce spectacular drafts quick however nonetheless require fixed human course-correction.
The Frontier launch got here simply three days after OpenAI released a brand new macOS desktop app for Codex, its AI coding software, which OpenAI executives described as a “command heart for brokers.” The Codex app lets builders run a number of agent threads in parallel, every engaged on an remoted copy of a codebase by way of Git worktrees.
OpenAI additionally launched GPT-5.3-Codex on Thursday, a brand new AI mannequin that powers the Codex app. OpenAI claims that the Codex group used early variations of GPT-5.3-Codex to debug the mannequin’s personal coaching run, handle its deployment, and diagnose check outcomes, just like what OpenAI told Ars Technica in a December interview.
“Our group was blown away by how a lot Codex was in a position to speed up its personal growth,” the corporate wrote. On Terminal-Bench 2.0, the agentic coding benchmark, GPT-5.3-Codex scored 77.3%, which exceeds Anthropic’s just-released Opus 4.6 by about 12 proportion factors.
The frequent thread throughout all of those merchandise is a shift within the person’s function. Somewhat than merely typing a immediate and ready for a single response, the developer or information employee turns into extra like a supervisor, dispatching duties, monitoring progress, and stepping in when an agent wants course.
On this imaginative and prescient, builders and information staff successfully change into center managers of AI. That’s, not writing the code or doing the evaluation themselves, however delegating duties, reviewing output, and hoping the brokers beneath them don’t quietly break issues. Whether or not that may come to go (or if it’s truly a good suggestion) continues to be extensively debated.