A thriller AI picture editor named “nano banana” not too long ago rose to the highest of LMArena, the preferred AI leaderboard. The mannequin simply bested its opponents within the enviornment, which lets customers take a look at AI fashions head-to-head. Now, Google DeepMind has revealed that nano banana is definitely the alias for Gemini 2.5 Flash Picture.
Earlier than the massive reveal, Googlers did drop some hints:
This Tweet is at the moment unavailable. It is likely to be loading or has been eliminated.
Now that the mannequin has been formally rolled out, Google DeepMind mentioned that Gemini will likely be higher at modifying your photographs. Merchandise like this transfer us a step nearer to a post-Photoshop world. As an alternative of studying the technical ins and outs of photograph modifying software program, which might take years to grasp, AI picture editors will make it attainable for anybody to edit a picture with only a few easy voice or textual content prompts — in concept.
The Google DeepMind crew says this mannequin has been skilled to make topics extra constant throughout numerous edits of AI-generated photographs. This has been a difficulty for AI picture fashions, given their unpredictable nature. I attempted out the brand new “nano banana” mannequin for myself, and it labored… fantastic.
Mashable Mild Velocity
SEE ALSO:
Apple eyes Google Gemini for Siri improve
The flexibility to add and natively edit pictures in Gemini has been round since April of this yr. With Gemini’s up to date mannequin, Google says you are able to do issues like change a topic’s outfit and site, whereas retaining their likeness the identical.
You may as well add a number of pictures and have the topics seem collectively in the identical photograph, or add and alter particular particulars in an uploaded picture to, say, see what a room appears to be like like with a distinct colour of paint or totally different furnishings.
Here is Gemini’s try at modifying my canine into the downward canine pose and relocating her to a yoga studio. Her likeness is identical, and it efficiently edited the picture to make her eyes open, however her physique is not arched in the best way it ought to be. (I’d know, I’ve seen this playful pose from her many instances.)
Here is my canine Lola, not doing yoga
Credit score: Mashable
Here is the Gemini-edited model after I prompted it to open her eyes, put her within the downward canine pose, and alter the background to a yoga studio. It is shut, however not fairly proper.
Credit score: Mashable
As Google DeepMind mentioned in its announcement, the mannequin may not all the time get it proper. There may nonetheless be inaccuracies with fantastic particulars, textual content within the picture, and inconsistencies. With my experiment, my canine’s fur appears to be like overly clean, however her general coloring, dimension, and form keep the identical. All photographs have a visual watermark and an invisible watermark referred to as SynthID to mitigate any confusion over whether or not they’re actual or AI-generated. This replace is now reside, so you possibly can strive it out for your self within the Gemini app.
Matters
Synthetic Intelligence
Google Gemini