Google’s native multimodal AI image generation in Gemini 2.0 Flash impresses with fast edits, style transfers
1 min read
Summary
Google has added multi-modal imaging to its AI chatbot, Gemini 2.0 Flash, so giving it the ability to create images as well as text in response to prompts from users.
This is the first time a US tech company has integrated such wide imaging generation within a model that can be used by consumers.
Gemini 2.0 Flash allows developers to create images from text prompts, and then edit the images using the same method.
It also enables conversational image editing, so users can refine images using prompts.
The new tool offers potential for creative design, enterprise tools, and AI-assisted visual storytelling.