Other Image Models on Vertex AI | Generated by AI

Home 2025.08

While Imagen is the flagship family of image generation models from Google on Vertex AI, there are other options available, particularly if you’re looking for different capabilities or open-source alternatives.

Here’s a breakdown of other image-related models you can use on Vertex AI:

Gemini Models (for multimodal use cases)

The Gemini family of models are multimodal, meaning they can process and understand multiple types of data, including text, images, video, and audio. While they are primarily known for their text-based outputs, some Gemini models have specific features for image-related tasks. For example, some Gemini models can:

This is a great option if you’re building an application that needs to handle both text and images in a conversational or integrated way, rather than just generating an image from a simple prompt.

Open-Source Models from Model Garden

Vertex AI’s Model Garden is a centralized hub where you can discover, test, customize, and deploy a wide range of models, including both Google’s proprietary models and a curated selection of open-source models. This is where you’ll find alternatives to Imagen, such as:

These open-source models often come with different licensing terms, performance characteristics, and customization options, giving you more flexibility for specific use cases.

Beyond image generation, Vertex AI offers pre-trained models for a variety of vision tasks, which you might find useful:


Back Donate