With the popularity of AI coding tools rising among some software developers, their adoption has begun to touch every aspect of the process, including the improvement of AI coding tools themselves.
The SWE-Bench Verified evaluation is basically a test of AI processing accuracy. It measures how well the AI solves a set of coding problems. According to OpenAI, GPT-5.1-Codex-Max "reaches the same ...
OpenAI has claimed that it built and shipped the Sora Android app in just 28 days, relying heavily on its AI coding agent, Codex. The company said the initial production version of Sora for Android ...
OpenAI has shipped new products at a relentless clip in the second half of 2025. Not only has the company released several new AI models, but also new features within ChatGPT, an AI-powered web ...
Check Point Research has found a flaw in OpenAI’s AI coding tool, Codex, that would allow bad actors to exfiltrate data ...
A major supply chain vulnerability in the OpenAI Codex CLI has been patched after discovery by Check Point Research.
American AI giants are backing a new effort to establish open standards for building agentic software and tools.
OpenAI launched Codex, an AI tool to write codes and fix bugs for developers. As an AI Agent, Codex could also help users with an Amazon order or a dinner reservation. Codex and GPT-4.5, which was ...
OpenAI has reported a surge in performance as GPT-5.1-Codex-Max reaching 76% in capability assessments, and warned of ...
OpenAI says the cyber capabilities of its frontier AI models are accelerating and warns Wednesday that upcoming models are likely to pose a "high" risk, according to a report shared first with Axios.
For the first time, OpenAI is publishing a report describing the use of its own products in companies. It is of little help ...