[ad_1]
Google quietly rolled out a powerful new version of Gemini last week that lets anyone edit photos using plain English commands instead of technical skills. The experimental version of Gemini 2.0 Flash with native image generation capabilities is now available to all users after being limited to testers only since last year.
Unlike most current AI image tools, this isn’t just about generating new images from scratch. Google has created a system that understands existing photos well enough to modify them through natural conversation, maintaining much of the original content while making specific changes.
This is possible because Gemini 2.0 is natively multimodal, meaning it can understand both text and images simultaneously. The model converts images into tokens—the same basic units it uses to process text—allowing it to manipulate visual content using the same neural pathways it uses to understand language. This unified approach means the system doesn’t need to call separate specialized models to handle different media types.
“Gemini 2.0 Flash combines multimodal input, enhanced reasoning, and natural language understanding to create images,” Google said in the official announcement. “Use Gemini 2.0 Flash to tell a story and it will illustrate it with pictures, keeping the characters and settings consistent throughout. Give it feedback and the model will retell the story or cha…

Read Entire Article
Screenshot generated in real time with SneakPeek Suite
BitRss World Crypto News | Market BitRss | Short Urls
Design By New Web | ScriptNet
[ad_2]