Disclosure Important reader notice
Important reader notice
This article is for general informational and educational purposes only. It is not legal, financial, tax, medical, security, compliance, or other professional advice, and you should not rely on it as a substitute for advice from a qualified professional who understands your specific situation.
AI tools, pricing, features, policies, laws, and platform terms can change quickly. We work to keep content accurate, but we do not guarantee that every detail is current, complete, or suitable for your use case. Always verify important claims with the original source before making business, legal, financial, safety, or purchasing decisions.
Some links may be affiliate, partner, or sponsored links. If you buy through them, AIUnpacking may earn compensation at no extra cost to you. Sponsored relationships are disclosed where applicable, and compensation does not override our editorial judgment.
DALL-E is dead. GPT Image 2 is now OpenAI's flagship image model and ranks #1 on every major leaderboard - but it still doesn't beat Midjourney for pure artistic style.
- #1 on LM Arena Text-to-Image leaderboard with record-breaking 93% win rate
- Near-perfect multilingual text rendering (Japanese, Korean, Chinese, Hindi, Bengali)
- Thinking Mode plans layout before rendering for complex multi-image outputs
- Native 2K resolution with experimental 4K support
- ChatGPT integration makes prompting conversational and iterative
- DALL-E brand is officially dead - all API endpoints shut down May 12, 2026
- Midjourney V7 still produces more aesthetically distinctive art
- Free tier severely limited; real work requires Plus ($20/month) or Pro ($200/month)
- Generated text still needs proofreading despite 99% accuracy claims
- Style range narrower than Midjourney; less suited for highly stylized creative work
DALL-E Is Dead in 2026: Here’s What OpenAI Replaced It With
On May 12, 2026, OpenAI permanently pulled the plug on DALL-E 2 and DALL-E 3. Every API endpoint stopped responding. Every developer who hadn’t migrated got cut off. The DALL-E brand - the name that introduced millions of people to AI image generation - is officially retired.
I’ve been tracking this transition since OpenAI first signaled it back in November 2025, and now that the dust has settled, I want to walk you through what actually happened, what replaced it, and whether the new stuff is any good.
The Timeline: How DALL-E Died
The DALL-E era spanned just over five years. Here’s the arc, because understanding how fast this moved tells you something important about AI image generation in 2026.
January 5, 2021 - OpenAI announced DALL-E 1, a 12-billion-parameter version of GPT-3 trained to generate images from text. It was a research demo more than a product, but it captured imaginations instantly.
April 6, 2022 - DALL-E 2 launched publicly. It could generate 1024×1024 images, do inpainting, and produce variations. This was the model that turned AI image generation into a mainstream conversation.
October 2023 - DALL-E 3 arrived, natively integrated into ChatGPT. It was a genuine leap forward: far better prompt adherence, dramatically improved text rendering inside images, and the conversational ChatGPT workflow that made prompting feel like talking to a designer rather than writing code.
March 25, 2025 - The moment the DALL-E name started dying. OpenAI flipped ChatGPT’s image generator from a separate DALL-E 3 model to GPT-4o’s native multimodal image generation. Users didn’t get a choice. Overnight, “DALL-E” inside ChatGPT became a legacy label for something that was no longer DALL-E.
April 23, 2025 - OpenAI released gpt-image-1 through the API. This was the 4o image generation capability packaged as a standalone API model. DALL-E 3 API users were now officially on notice.
December 16, 2025 - GPT Image 1.5 rolled out, branded as “The new ChatGPT Images.” It brought more precise editing, face and logo preservation, and better instruction following. OpenAI also confirmed the DALL-E deprecation timeline.
April 21, 2026 - ChatGPT Images 2.0 launched, powered by the gpt-image-2 model. This model immediately claimed the #1 spot on every Image Arena leaderboard, including LM Arena and Artificial Analysis, with a record-breaking +242 point lead and a 93% win rate. It was a clean sweep.
May 12, 2026 - DALL-E 2 and DALL-E 3 API endpoints shut down permanently. Anyone still calling those endpoints got errors.
So as of today - May 18, 2026 - there is no more DALL-E. The name lives on in Wikipedia entries and old YouTube tutorials. What you actually use now is GPT Image 2 inside ChatGPT (branded as ChatGPT Images 2.0) or the gpt-image-2 model via the API.
What GPT Image 2 Actually Does
I’ve spent the past month putting GPT Image 2 through its paces, and I’ll say this plainly: it is not a minor update. OpenAI calls it a “state-of-the-art image generation model,” and for once, the marketing isn’t exaggerating much.
Text Rendering Is Finally Solved (Mostly)
The single biggest practical upgrade in GPT Image 2 is text rendering. Previous models - including DALL-E 3, GPT Image 1, and honestly every AI image generator until now - struggled with text. You’d get garbled characters, nonsense words, or letters that looked vaguely correct but were actually gibberish.
GPT Image 2 claims 99% text accuracy, and in my testing across English prompts, that number holds up. More importantly, it handles multilingual text: Japanese, Korean, Chinese, Hindi, and Bengali all render cleanly. I generated a Hindi poster with Devanagari script, and every character was correct. That was not possible with any previous OpenAI image model.
VentureBeat tested it with manga-style pages containing Japanese text bubbles of significant density, and the output was publishable-quality. WIRED’s review confirmed the improvement but noted some remaining struggles with non-English languages at very small font sizes. I saw the same - small Chinese characters in a diagram sometimes softened at the edges - but for most practical use, the text rendering is finally reliable enough for production work.
Thinking Mode and Multi-Image Output
The headline feature OpenAI demoed at launch is “Thinking Mode.” When enabled, GPT Image 2 doesn’t just generate pixels - it plans the layout, structures the composition, and reasons about spatial relationships before rendering. This lets it handle prompts with over 1,000 words of instruction.
The practical result: you can ask for an entire study booklet with multiple pages, consistent characters, and labeled diagrams, and it’ll produce the whole thing in one go. You can ask for a set of 10 infographic slides with consistent branding. You can ask for a comic strip with panels, speech bubbles, and a coherent narrative.
For business users, this is transformative. I generated a complete 6-slide product pitch deck - titles, bullet points, charts, consistent color palette - from a single long prompt. It took about 45 seconds in Thinking Mode and the output was genuinely usable with minor cleanup.
Resolution and Quality
GPT Image 2 outputs at native 2K resolution (2048px on the long edge) by default. Through the API, 4K is available in beta - up to 3840×2160 (roughly 8.3 megapixels). OpenAI flags anything above 2560×1440 as experimental, and in my testing, 4K outputs sometimes showed subtle artifacts at full zoom. But 2K is the new floor, and it’s a meaningful jump from the 1024×1024 standard that DALL-E 3 shipped with.
Photorealism has improved significantly. Side-by-side comparisons with GPT Image 1.5 show better skin texture, more natural lighting, and fewer anatomical errors. The Reddit community (r/singularity, r/ChatGPT) broadly agrees that GPT Image 2 “is the new standard in photorealistic image generation,” though some users note it can produce an overly smooth, “AI-looking” finish in certain lighting conditions.
Context Awareness and Iterative Editing
Because GPT Image 2 lives inside ChatGPT, it remembers what you generated earlier in the conversation. If you generate a product mockup and then ask “make the background darker and add our logo in the top right,” it understands what you’re referring to and edits the existing image rather than generating something entirely new. This was possible with GPT Image 1.5 but is noticeably more reliable in the 2.0 version.
Competitor Landscape: Where GPT Image 2 Stands
OpenAI’s strongest competitors have not stood still. Here’s where things stand in May 2026.
Midjourney V7
Midjourney V7 has been the default model since June 2025, and it remains the benchmark for aesthetic quality. If you want an image that looks like a Frazetta painting, a moody cinematic still, or a hyper-stylized editorial photograph, Midjourney V7 still produces more compelling results than GPT Image 2. Its style range is wider, and its artistic “taste” - that hard-to-define quality that makes an AI image feel like intentional art rather than a generated asset - is unmatched.
Where Midjourney loses: text rendering is still inconsistent, it requires Discord-based prompting (which many business users find clunky), and there’s no native conversational editing workflow. One Reddit user put it well: “Midjourney wins for ‘make me a cool image that looks like a Frazetta painting.’ ChatGPT’s generator is better at ‘give me a 15’x15’ storage room with labeled shelves.’”
Pricing: Basic plan at roughly $0.04–$0.08 per image equivalent, various subscription tiers.
Ideogram 3.0
Ideogram 3.0 launched on March 26, 2025 (and was refined on May 1, 2025), and it held the text-in-image crown until GPT Image 2 arrived. It introduced Style References - upload up to 3 reference images and Ideogram matches the aesthetic - and its text rendering across typography-heavy designs remains excellent, rated at roughly 90-95% accuracy by independent testers.
Ideogram is still the better choice if your primary use case is graphic design with heavy typography - logos, branding mockups, poster text - and you want a dedicated tool rather than working inside ChatGPT. Its free tier is also more generous than ChatGPT’s for image generation specifically.
Adobe Firefly
Adobe Firefly’s biggest differentiator in 2026 is commercial safety. Trained on Adobe Stock images and public domain content, Firefly outputs are indemnified for commercial use, which makes it the default choice for brand teams, agencies, and enterprises with legal departments. Its text rendering is better than Midjourney’s but behind Ideogram and GPT Image 2, and its aesthetic quality is solidly “good but rarely stunning.”
Firefly shines when integrated into the Adobe ecosystem - Photoshop Generative Fill, Illustrator vector recolor, Express templates - and for teams that need revision tracking, approval workflows, and asset management inside Creative Cloud.
Google Nano Banana / Imagen 4
Google’s Imagen 4 (branded informally as Nano Banana 2 in the latest iteration) was the leader before GPT Image 2 dethroned it. It still produces competitive results, especially for photorealistic product shots, and its per-image pricing ($0.02–$0.06) undercuts OpenAI’s API costs. If you’re building an application with high-volume image generation, Google’s pricing advantage is real.
Pricing: What You Actually Pay in 2026
OpenAI’s image generation pricing has gotten more complex since the simple “DALL-E 3 costs $0.04 per image” era. Here’s the breakdown as of May 2026:
ChatGPT Plans:
- Free: Limited access to ChatGPT Images 2.0 with usage caps. Community reports suggest roughly 2 images per day for free users.
- Plus ($20/month): Expanded access with generation limits that scale with overall ChatGPT usage. Most casual-to-moderate creators can work within these limits.
- Pro ($200/month): Significantly higher generation limits, virtually unlimited for most individual users. The tier for heavy creators.
- Team/Enterprise: Custom limits, shared workspaces, admin controls.
API Pricing (gpt-image-2):
- Input tokens: $8.00 per 1M tokens
- Cached input tokens: $2.00 per 1M tokens
- Per-image costs vary by quality and resolution, roughly $0.04–$0.35
- For comparison, DALL-E 3 was a flat $0.04–$0.12 per image
API Pricing (gpt-image-1.5, still available):
- Approximately $0.04 per standard image, competitive with Google Imagen 4
The token-based pricing model means complex prompts with long text descriptions cost more to process. But the per-image output cost is the dominant factor for most use cases, and GPT Image 2’s output quality generally justifies the premium over GPT Image 1.5 for production work.
What GPT Image 2 Still Can’t Do
I want to be direct about the limitations, because the launch hype was substantial.
Aesthetic style range remains narrower than Midjourney. GPT Image 2 can approximate most styles you describe, but the results often feel like “ChatGPT’s version of that style” rather than a convincing execution. For creative professionals working in distinctive visual styles, Midjourney V7 is still the better tool.
Generated text still needs proofreading. 99% accuracy is impressive, but it means 1 in 100 characters is wrong. On a dense infographic with 500 characters of text, that’s 5 errors. Always proofread. Never ship AI-generated text in images without human review.
WIRED’s testing flagged remaining struggles with very small non-English text. If you’re generating technical diagrams with fine-print labels in Chinese or Arabic, verify every character.
The free tier is mostly a demo. If you want to do real work with GPT Image 2, you need at least ChatGPT Plus at $20/month. This isn’t unreasonable - Midjourney and Ideogram both require paid subscriptions for serious use - but it’s a friction point compared to the old Bing Image Creator days when DALL-E 3 was effectively free.
Content policy restrictions are real. OpenAI blocks generations that could produce CSAM, sexual deepfakes, or content violating IP. The filters are sometimes overbroad - community forums contain regular complaints about innocuous prompts being rejected - and there’s no appeal mechanism inside ChatGPT.
Migration: If You Still Use DALL-E 3
If you’re reading this and still have code calling the DALL-E 3 API endpoints, those calls have been failing since May 12, 2026. You need to migrate to gpt-image-1.5 or gpt-image-2.
The simplest migration path is swapping your model parameter from dall-e-3 to gpt-image-1.5. The API structure is similar enough that basic generation calls usually work with minimal changes. For gpt-image-2, the quality jump is significant, but you’ll need to adjust your pricing expectations - it costs more per image, and the token-based billing means prompt length affects cost.
OpenAI’s deprecation docs and migration guide are linked in the Verified Sources section below. Microsoft’s Azure OpenAI service retired DALL-E 3 even earlier - February 18, 2026 - and now offers GPT Image 1.5 as the replacement. Bing Image Creator has already shifted to MAI-Image-2, Microsoft’s own second-generation model.
Bottom Line: What I Actually Recommend
The AI image generation landscape in May 2026 is genuinely competitive, and the right tool depends on what you’re building:
Use ChatGPT Images 2.0 (GPT Image 2) if: You need text-heavy images, infographics, business presentations, multi-page documents, or quick concept mockups inside a conversational workflow. It’s the best all-around tool for practical, production-oriented image generation.
Use Midjourney V7 if: You prioritize artistic quality, aesthetic distinctiveness, cinematic style, or any creative work where visual “taste” matters more than text accuracy. It’s still the king of beautiful images.
Use Ideogram 3.0 if: Your primary output is graphic design with heavy typography - logos, posters, branding mockups - and you want a dedicated tool rather than ChatGPT. Its free tier is more practical than ChatGPT’s.
Use Adobe Firefly if: Commercial safety, legal indemnification, and Creative Cloud integration are non-negotiable requirements. This is the enterprise-safe choice.
Use Google Imagen 4 if: You need high-volume, cost-sensitive generation via API, and you’re optimizing for per-image cost rather than maximum quality.
For my own workflow in 2026, ChatGPT Images 2.0 has become the default starting point. I generate concepts there, refine them conversationally, and if I need artistic polish beyond what it can deliver, I take the concept to Midjourney for the final render. The tools complement each other - they don’t replace each other.
DALL-E had a remarkable five-year run. It introduced generative AI imagery to the mainstream, pushed text-in-image capabilities forward, and set the standard that competitors had to beat. But DALL-E 3 is now a historical artifact - shut down, deprecated, and replaced. The future of OpenAI’s image generation is GPT Image, and the future arrived faster than most people expected.