GPT-5.4 Accidentally Leaked in OpenAI Codex Repository: What the Code Reveals
Developers spotted GPT-5.4 references twice in OpenAI's public Codex repo pull requests, revealing a 2-million-token context window and full-resolution image handling before the code was scrubbed.
Developers spotted GPT-5.4 references twice in OpenAI's public Codex repo pull requests, revealing a 2-million-token context window and full-resolution image handling before the code was scrubbed.
A Model Name That Was Not Supposed to Be Public
On February 28, 2026, a pull request appeared in OpenAI's public Codex GitHub repository containing something developers were not meant to see: a version cutoff reading "GPT-5.4 or newer" for an image handling feature. The reference was quickly scrubbed via force pushes and edits, but not before X user @scaling01 flagged it publicly. Then on March 2, a second Codex pull request surfaced with a direct function call using gpt-5.4 as the model argument, plus a slash command explicitly described as "toggle Fast mode for GPT-5.4." An OpenAI employee named Tibo compounded the situation by accidentally posting a screenshot on X that included GPT-5.4 in the model picker before deleting it.
These are not speculative rumors. They are code artifacts from OpenAI's own repositories, providing concrete evidence that GPT-5.4 exists internally and is actively being developed for production deployment.
What the Leaked Code Reveals
Two-Million-Token Context Window
The most significant technical detail extracted from the leaked code is a context window of 2 million tokens. For perspective, GPT-5.3 Codex operates with a 256,000-token context window. An eight-fold expansion to 2 million tokens would place GPT-5.4 in territory currently occupied only by Google's Gemini 3.1 Pro, which offers a 1-million-token window. If the 2-million-token figure holds, GPT-5.4 would set a new record among commercially available frontier models.
A context window of this size has practical implications for enterprise users. Legal firms could process entire case files in a single conversation. Software development teams could load complete codebases for analysis. Research teams could feed entire paper corpora for synthesis. The shift from hundreds of thousands to millions of tokens is not incremental—it fundamentally changes what tasks are feasible in a single model interaction.
Full-Resolution Image Handling
The leaked pull request adds a feature flag that preserves original PNG, JPEG, and WebP image bytes, passing them directly to the Responses API without compression. Current GPT models compress uploaded images before processing, which can degrade analysis quality for tasks requiring pixel-level precision—medical imaging, satellite imagery analysis, document OCR, and design review among them.
The code sets the minimum model version to (5, 4), meaning this feature explicitly requires GPT-5.4 or newer. This is not a general capability being backported; it is a GPT-5.4-specific enhancement.
Fast Mode Toggle
The slash command for "toggle Fast mode for GPT-5.4" suggests OpenAI is building a speed-optimized inference path, likely similar to the existing approach where models offer different speed-quality tradeoffs. This aligns with OpenAI's recent pattern of releasing both standard and optimized variants of each model generation.
Timeline and Release Probability
OpenAI has maintained an accelerating release cadence within the GPT-5 series:
| Model | Release Date |
|---|---|
| GPT-5 | August 2025 |
| GPT-5.1 | November 2025 |
| GPT-5.2 | January 2026 |
| GPT-5.3 Codex | February 5, 2026 |
The interval between releases has compressed from three months to roughly one month. If this pace continues, a March or early April release for GPT-5.4 is plausible.
Prediction markets on Manifold currently give GPT-5.4 a 55% probability of shipping before April 2026 and 74% before June. However, shipping both GPT-5.3 and GPT-5.4 within the same month would require unprecedented acceleration even by OpenAI's recent standards.
OpenAI has made no official announcement regarding GPT-5.4. The company has not commented on the leaked code references.
Competitive Context
The timing of these leaks is notable. OpenAI is navigating a period of significant public relations pressure following its Pentagon classified network deal, which has triggered the "Cancel ChatGPT" movement and driven measurable user migration to Anthropic's Claude. A major model release could serve as both a technical milestone and a strategic counter-narrative.
Meanwhile, the competitive landscape has intensified. Google's Gemini 3.1 Pro already offers a 1-million-token context window with strong multimodal performance. Anthropic's Claude Opus 4.6 has been drawing praise for coding and reasoning capabilities. A 2-million-token GPT-5.4 with enhanced vision capabilities would represent OpenAI's bid to reclaim technical leadership on multiple fronts simultaneously.
What This Means for Developers
For developers building on the OpenAI API, the leaked features suggest several preparation steps. Applications that currently chunk documents to fit within context limits may need architectural revisions to take advantage of million-scale windows. Image processing pipelines that pre-compress inputs before API submission may benefit from passing raw bytes directly. Teams evaluating multi-model strategies should factor in the possibility that GPT-5.4 could narrow or close capability gaps that currently favor competitors in specific domains.
The Fast mode toggle also suggests that OpenAI is continuing to invest in inference optimization, which could translate to lower latency and potentially lower costs for real-time applications.
Conclusion
The GPT-5.4 leak is not a roadmap slide or a conference teaser. It is production code referencing a specific model version with specific capabilities—a 2-million-token context window, full-resolution image handling, and a fast inference mode. Whether the model ships in March, April, or later, the code confirms that OpenAI is building the next iteration of its flagship model with substantial capability expansions. Developers and enterprise users should monitor OpenAI's official channels for an announcement that, based on the evidence, appears to be a matter of weeks rather than months away.
Editor's Verdict
GPT-5.4 Accidentally Leaked in OpenAI Codex Repository: What the Code Reveals earns a solid recommendation within the gpt space.
The strongest case for paying attention is A 2-million-token context window would set a new commercial frontier model record, enabling entirely new categories of single-session tasks, which raises the bar for what readers should now expect from peers in this space. Reinforcing that, full-resolution image handling removes a significant limitation for vision-dependent professional workflows adds practical value rather than just headline appeal. The broader signal worth registering is straightforward: two separate Codex pull requests and an employee screenshot confirm GPT-5.4 exists as an internal OpenAI model, not speculation. On the other side of the ledger, no official announcement from OpenAI; all information derives from leaked code that may not reflect final specifications is a real constraint, not a marketing footnote, and it should factor into any serious decision. Layered on top of that, A 2-million-token context window will likely carry significant cost implications for API users narrows the set of teams for whom this is an obvious yes.
For ChatGPT power users, OpenAI API customers, and enterprise teams already running on the OpenAI stack, this is a serious evaluation candidate, not just a curiosity to bookmark. For everyone else, the safer posture is to monitor coverage and revisit once the use cases that matter to your team are demonstrated in the wild.
Pros
- A 2-million-token context window would set a new commercial frontier model record, enabling entirely new categories of single-session tasks
- Full-resolution image handling removes a significant limitation for vision-dependent professional workflows
- Fast mode toggle suggests users will have speed-quality flexibility without switching models
- Accelerating release cadence demonstrates OpenAI's engineering velocity and competitive responsiveness
- Code-level evidence provides more reliable insight than roadmap speculation or conference previews
Cons
- No official announcement from OpenAI; all information derives from leaked code that may not reflect final specifications
- A 2-million-token context window will likely carry significant cost implications for API users
- Shipping GPT-5.4 weeks after GPT-5.3 could fragment developer attention and create migration fatigue
- Enhanced capabilities do not address the trust and ethical concerns driving the current Cancel ChatGPT movement
References
Comments0
Key Features
GPT-5.4 was discovered twice in OpenAI's public Codex GitHub repository pull requests on February 28 and March 2, 2026. The leaked code reveals a 2-million-token context window (8x larger than GPT-5.3's 256K), full-resolution image handling that preserves original PNG/JPEG/WebP bytes without compression, and a Fast mode toggle for speed-optimized inference. Prediction markets give the model a 55% chance of launching before April 2026.
Key Insights
- Two separate Codex pull requests and an employee screenshot confirm GPT-5.4 exists as an internal OpenAI model, not speculation
- The 2-million-token context window would double Gemini 3.1 Pro's 1M-token record and represent an 8x expansion over GPT-5.3 Codex
- Full-resolution image handling eliminates compression artifacts, enabling pixel-level analysis for medical imaging, satellite data, and document OCR
- The Fast mode toggle indicates OpenAI is building speed-optimized inference paths, suggesting both standard and rapid variants at launch
- OpenAI's release cadence has compressed from 3-month to 1-month intervals within the GPT-5 series, making a March-April launch plausible
- Prediction markets assign 55% probability to a pre-April release and 74% to pre-June, reflecting high confidence in near-term availability
- The leak arrives amid the Cancel ChatGPT movement, creating strategic pressure for OpenAI to deliver a compelling technical counter-narrative
- Enterprise applications involving legal case files, full codebases, and research corpora would benefit most from million-scale context windows
Was this review helpful?
Share
Related AI Reviews
ChatGPT Personal Finance Launches: Connect Your Bank for AI-Powered Money Advice
OpenAI launched a personal finance dashboard in ChatGPT Pro on May 15, integrating with 12,000+ banks via Plaid to offer AI-driven spending analysis and multi-year financial planning.
OpenAI Brings Codex to iOS and Android: Remote Coding Workflows From Your Phone
OpenAI launched Codex on the ChatGPT mobile app for iOS and Android on May 14, 2026, enabling developers to monitor, approve, and manage AI coding agents remotely with new Hooks and Remote SSH features.
OpenAI Daybreak: GPT-5.5-Cyber Brings AI to Offensive Security Testing
OpenAI launched Daybreak on May 12, 2026, a cybersecurity platform pairing GPT-5.5-Cyber with Codex Security for automated threat modeling, vulnerability discovery, and patch validation in authorized environments.
OpenAI Workspace Agents: ChatGPT Becomes a Persistent Enterprise Collaborator
OpenAI launches Workspace Agents in ChatGPT, enabling teams to build persistent cloud-based AI agents that automate workflows across Slack, Salesforce, and internal tools.
