TL;DR
Google confirmed an AI-generated 2FA bypass exploit the same day OpenAI launched Daybreak, its new agentic cybersecurity platform built on GPT-5.5 and Codex Security.
On May 11, Google's Threat Intelligence Group published what amounts to a before-and-after moment for software security: a Python script, written with the assistance of artificial intelligence, designed to bypass two-factor authentication on a widely-used web administration tool. A prominent cybercrime group had intended to weaponize it in a mass exploitation campaign. Google alerted the vendor in time, but the code already existed.
The same day, OpenAI shipped Daybreak.
The timing was not coordinated. That is precisely the point. Daybreak is OpenAI's first dedicated enterprise security platform, embedding GPT-5.5 models and a purpose-built Codex Security engine directly into the software development lifecycle, from pull requests to production deployments. The coincidence of the two announcements, as reported by ibtimes.com, clarified something the industry had been debating in the abstract: the attacker-defender race in artificial intelligence has moved from hypothetical to operational.
The significance of the zero-day
The attack's importance is not in the specific target. Web-based system administration tools get patched. What matters is the process shift. Writing a functional exploit has traditionally required deep expertise, significant time, and access to specialized knowledge. A working AI-generated 2FA bypass changes the cost structure of that activity, potentially lowering the barrier for threat actors who previously lacked that technical depth.
John Hultquist, chief analyst at Google's Threat Intelligence Group, described the situation plainly: "The game's already begun and we expect the capability trajectory is pretty sharp." He was characterizing attacker capabilities. Daybreak is OpenAI's attempt to redirect that trajectory toward defense.
What Daybreak actually does
Daybreak positions itself as the defender's answer to that trajectory. Rather than offering security scanning as a bolt-on product, the platform integrates into the development workflow, flagging vulnerabilities as code is written and reviewed, not after it ships. The Codex Security engine, running on GPT-5.5 models, is intended to surface exploitable patterns at the point where fixing them is cheapest.
This is OpenAI's most direct move into enterprise security. It arrives four weeks after Anthropic unveiled Project Glasswing and its Claude Mythos Preview model, covering similar ground. Within six weeks, both dominant AI research labs had shipped dedicated security products. The release cadence tracked by llm-stats.com shows GPT-5.5 and GPT-5.5 Pro arriving in late April, with Daybreak following shortly after. The commercial logic is straightforward: enterprise security is a regulated, high-stakes market where incumbents who can demonstrate auditability gain a durable advantage.
Context: a crowded six weeks
The broader model landscape shifted considerably in the weeks surrounding these announcements. DeepSeek released V4-Pro in late April, claiming a one-million-token context window and benchmark performance competitive with top closed-source models, as covered by Yahoo Finance. Pricing data from Price Per Token shows V4-Pro available at $2.40 per million input tokens across multiple providers. xAI shipped Grok 4.3. The AI Release Tracker now covers 155 frontier models across major labs, and the release pace has not slowed.
But the AI-built zero-day changes the frame for all of that activity. Capability gains in general-purpose models translate, with relatively minor adaptation, into improved performance on offensive security tasks. The same properties that make artificial intelligence faster at code generation make it faster at exploit generation. Google's disclosure proves this has moved from theoretical to practical.
What this means for practitioners
For ML engineers and applied scientists working on security tooling, Daybreak and Glasswing represent a bet that AI can scale defense faster than attackers can scale offense, by automating vulnerability detection across millions of lines of code. Whether that holds depends on two things neither platform has yet demonstrated publicly: false positive rates at production scale, and the capacity to detect novel attack patterns rather than known vulnerability categories.
Public evaluation data for both platforms remains thin. What exists is a plausible hypothesis, a real threat that materialized on the same day as the product meant to address it, and an open question the industry has not answered.
If a moderately resourced threat actor can already use artificial intelligence to build working exploits, the more pressing question for practitioners is not whether AI-powered defense works at all, but how long until the attack side removes the human from its own loop entirely.
FAQ
Q: What is OpenAI Daybreak?
A: Daybreak is OpenAI's enterprise cybersecurity platform, integrating GPT-5.5 models and a Codex Security engine into software development pipelines to identify vulnerabilities during code review and deployment, rather than after code ships.
Q: What was the first confirmed AI-built zero-day exploit?
A: On May 11, 2026, Google's Threat Intelligence Group disclosed a Python script believed to have been created with AI assistance, capable of bypassing two-factor authentication on a popular web-based administration tool. The attack was disrupted before deployment.
Q: How does Daybreak compare to Anthropic's Project Glasswing?
A: Both embed AI security capabilities into development workflows. Glasswing uses Claude Mythos Preview; Daybreak runs on GPT-5.5 and Codex Security. Daybreak arrived four weeks later. Neither platform has released detailed public benchmarks for false positive rates or novel exploit detection.
Q: Can AI models write working cyberattack tools?
A: Yes, as Google's May 2026 disclosure confirmed. AI systems can assist in writing functional exploits, reducing the technical barrier for threat actors, which is why the parallel emergence of AI-powered defensive platforms is being taken seriously by security practitioners.
About the Author
Guilherme A.
Former dentist (MD) from Brazil, 41 years old, husband, and AI enthusiast. In 2020, he transitioned from a decade-long career in dentistry to pursue his passion for technology, entrepreneurship, and helping others grow.
Connect on LinkedIn