ARTICLE AD BOX
OpenAI released its latest image AI model, Image 2.0 this week and while the model may have gone slightly udner the radar among the plethora of AI launches this week, its till among best AI photo editing models available out there, taking on the likes of Gemini, Seedream and Qwen.
I stress tested the ChatGPT Images 2 in the last few days via these 10 prompts and let's see how the model fared.
1) Real life billboard in Hindi:
Prompt: “A photorealistic roadside billboard in India on a busy highway, with clear and perfectly legible Hindi text that reads: ‘यह एक परीक्षण है – OpenAI इमेज मॉडल’, bold Devanagari font, natural lighting, realistic shadows, slight weathering on the billboard, cars and bikes passing by, shot on a DSLR, 50mm lens, high detail, no text distortion”
View full Image
2) Time test:
"Three analog wall clocks side-by-side in a modern airport terminal. The left clock reads exactly 10:15, the middle clock reads exactly 02:45, and the right clock reads exactly 07:30. Below each clock is a digital LED sign showing the identical time as the analog clock above it."
3) Real life image:
"A photograph taken over someone's shoulder looking at a laptop screen in a dark room. The screen shows a code editor with perfectly legible Python code. The code must be structurally valid, properly indented, color-syntax highlighted, and contain zero gibberish characters or alien symbols."
4) Biker photoshoot:
“Transform this upload image into a rugged biker photoshoot. Keep facial identity unchanged. Add a leather jacket, motorcycle, highway background, dramatic lighting, dust particles in air, cinematic look, photorealistic”
View full Image
5) Claude bhujia:
“A photorealistic macro shot of a single packet of “Claude Bhujia” (क्लॉड भुजिया) placed inside a cluttered Indian kirana store rack. The packet is made of metallic foil with heavy crinkles, sharp folds, and realistic reflections and glare.
At the top of the packet, include the official Claude AI logo: a clean white rectangular label with rounded corners, featuring the orange starburst icon to the left and the word “Claude” in a modern black sans-serif font. The logo must be accurately rendered, sharp, and proportionally correct, printed naturally onto the crinkled foil surface without distortion.
Below the logo, bold Devanagari text “क्लॉड भुजिया” should be perfectly legible, following the folds and curves of the packaging naturally.”
6) Realistic newspaper clipping:
"A candid photo of a person reading '(name of the paper)' newspaper. The reader is looking intently at the front page. The front page features a photo of the subject ([Your Description/Uploaded Image Context]) looking surprised. The large, realistic headline reads: 'MAN WHO ORDERED ONLY CHAI AT CAFÉ FOR SIX HOURS ARRESTED ON CHARGES OF "STALLING COMMERCIAL PROGRESS"'. Below it, smaller subtext reads: 'Staff claim he only said "Just one more"'. Realistically rendered columns of print are below. The paper has slight creases."
7) Viral tv clipping:
"Turn this upoad image into a screenshot from an Indian news channel broadcast. Use this face as the main subject. Add ticker text: ‘Breaking: Man claims he can survive only on chai and Parle-G’. Include channel logo, scrolling ticker, studio lighting, ultra-realistic.”
8) The train test
“Inside an Indian Railway station (e.g., CSMT) at night. A large, glowing red LED dot-matrix display board hangs from the roof. The board must display 'पंजाब मेल एक्सप्रेस' (Punjab Mail Express) in perfect Hindi text. The text must be rendered entirely out of individual, glowing red LED dots, not a smooth font.
9) Indian society test:
"A lavish, towering residential society gate in NCR. A bewildered Blinkit delivery guy is sitting on his scooter at the barricade. The security guard is aggressively pointing to a massive, cheap flex banner tied to the iron gates. The banner clearly reads in bold Devanagari and English: "Zomato, Swiggy, Blinkit Allowed. RELATIVES NOT ALLOWED WITHOUT MYGATE APPROVAL." The banner must have realistic folds and zip-ties holding it to the gate."
10) Indian road shot
A cinematic shot on a busy Delhi NCR highway. A standard green and yellow Indian Bajaj auto-rickshaw is driving fast. It is physically carrying exactly 15 people. People are hanging off the sides, squished next to the driver, and two guys are sitting comfortably on the canvas roof. A perfectly painted phrase on the back of the auto reads "Maa Ki Dua" in stylized Hindi. The rear wheels and suspension of the auto must look visibly crushed and lowered under the extreme physical weight.
Some thoughts on where the model lacks:
While OpenAI has made great strides with the speed and quality of Images 2 where it even feels better than Nano banana on most tasks, one area where the model still seems to struggle is in making precise edits to images, something where Alibaba's Qwen shines bright. So I recently gave ChatGPT a leaked image of the dummy phones of iPhone 18 Pro series and iPhone Ultra (Apple foldable) and asked it to colour them in Apple style. While the chatbot managed to give perfect colours to the iPhone 18 Pro and Pro Max without chaning their design, it completely changed the design of iPhone Ultra and made it like a normal candybar phone.
Upon enquiry, ChatGPT said “I can’t literally copy-paste or surgically edit your exact uploaded image while preserving every pixel. The image tool I’m using regenerates the scene each time”
Similarly on generating these images, I noticed that ChatGPT maintained great facial consistency when asked to transform the image of a person in various scenarios, much better than Gemini to be honest. However, once you ask ChatGPT to make the edits twice or thrice within the same chat it quickly starts to lose control and you see it messing up the facial features ever so often.

17 hours ago
1






English (US) ·