Google Launches Whisk: A New AI Tool That Remixes Images Using Visual Prompts

Google launches Whisk, an experimental AI tool that generates new images by combining elements from multiple source pictures rather than text prompts.

Al Landes Avatar
Al Landes Avatar

By

Our editorial process is built on human expertise, ensuring that every article is reliable and trustworthy. AI helps us shape our content to be as accurate and engaging as possible.
Learn more about our commitment to integrity in our Code of Ethics.

Image Credit: Whisk

Key Takeaways

  • Revolutionary visual-first approach allows users to remix images by combining subjects, scenes, and styles from source photos
  • Tool leverages Google’s Gemini and Imagen 3 AI systems to interpret and generate new imagery
  • Currently available exclusively to US users through Google Labs as an experimental creative tool

Why it matters: Google has introduced Whisk, an experimental AI image generator that takes a unique visual-first approach to creating new images. Engadget reports that unlike traditional text-based AI art tools, Whisk allows users to combine elements from multiple source images, potentially transforming how creators brainstorm and visualize concepts.

The Big Picture: The Verge reports that the tool works by combining three key elements:

  • Subject image (like a person or object)
  • Scene image (background or setting)
  • Style image (artistic direction)

Technical Innovation: Whisk leverages two powerful AI systems working in tandem:

  • Gemini generates detailed captions of uploaded images
  • Imagen 3 creates new images based on these descriptions
  • Results maintain key characteristics while allowing creative variation

Creative Applications: Google positions Whisk as a rapid ideation tool rather than a precision editor. It won’t replace the best monitor for video editing for example. Users can:

  • Create digital stickers and enamel pin designs
  • Generate plushie concepts
  • Explore quick visual variations of ideas

Looking Forward: While currently limited to US users through Google Labs, Whisk represents a significant shift in AI image generation by prioritizing visual rather than textual inputs. However, the tool’s focus on rapid exploration rather than production-ready outputs suggests it’s meant to complement rather than replace existing creative tools.

Share this

At Gadget Review, our guides, reviews, and news are driven by thorough human expertise and use our Trust Rating system and the True Score. AI assists in refining our editorial process, ensuring that every article is engaging, clear and succinct. See how we write our content here →