LLM Reference

Gemini 1.0 Models by Google DeepMind

Google DeepMindProprietaryProprietary
5 models2023–2024Up to 1m ctxFrom $0.125/1M input

Details

ResearcherGoogle DeepMind
LicenseProprietary
Commercial useCommercial use with conditions
Models5
Released2023–2024
Max context1m

Capabilities

Vision2 of 5 models
Multimodal1 of 5 models
Structured Outputs3 of 5 models

Links

Website

About

Google's Gemini 1.0 is a family of large language models (LLMs) that excels in capability and versatility 126. The Gemini family comprises three distinct models tailored to different tasks and hardware requirements: Gemini Ultra, aimed at tackling highly complex tasks; Gemini Pro, a flexible all-round model; and Gemini Nano, optimized for efficiency and on-device usage 16. These models are multimodal, capable of processing diverse data types such as text, code, audio, images, and video 1411. Known for achieving state-of-the-art performance, Gemini 1.0 models have been rigorously tested, at times surpassing human expert performance 14. Focused on safety and responsibility, these models undergo comprehensive safety evaluations before deployment 1.

Current Variants

Use-when guidance is derived from seed capabilities, context, release, and replacement fields.

5 in view

Use when the workload needs 12k context, structured outputs, and multimodal inputs.

2024-0412k contextstructured outputsmultimodal inputs

Use when the workload needs 1m context.

2023-121m context

Use when the workload needs 32k context and structured outputs.

2023-1232k contextstructured outputs

Use when the workload needs 32k context.

2023-1232k context

Use when the workload needs 33k context, structured outputs, and multimodal inputs.

2023-1233k contextstructured outputsmultimodal inputs

Release Timeline

2 release groups
2024-04
1 current
Gemini 1.0 Pro Vision
12k contextstructured outputsmultimodal inputs
Current
2023-12
4 current
Gemini 1.0 Nano
32k context
Current
Gemini 1.0 Pro
32k contextstructured outputs
Current
Gemini 1.0 Pro on Google Vertex AI
33k contextstructured outputsmultimodal inputs
Current
Current

Specifications(5 models)

Gemini 1.0 model specifications comparison
ModelReleasedContextVisionMultimodalStructured Outputs
Gemini 1.0 Pro Vision2024-0412kYesNoYes
Gemini 1.0 Ultra2023-121mNoNoNo
Gemini 1.0 Pro2023-1232kNoNoYes
Gemini 1.0 Nano2023-1232kNoNoNo
Gemini 1.0 Pro on Google Vertex AI2023-1233kYesYesYes

Available From(1 provider)

Pricing

Gemini 1.0 model pricing by provider
ModelProviderInput / 1MOutput / 1MType
Gemini 1.0 Pro on Google Vertex AIGCP Vertex AI$0.125$0.375Serverless
Gemini 1.0 ProGCP Vertex AI$0.5$1.5Serverless
Gemini 1.0 Pro VisionGCP Vertex AI$0.5$1.5Serverless
Gemini 1.0 UltraGCP Vertex AI$1$3Serverless

Frequently Asked Questions

What is Gemini 1.0 used for?
Gemini 1.0 is used for vision and multimodal work, structured outputs, and coding. The family description and listed model capabilities point to those workloads as the best fit.
How does Gemini 1.0 compare to Gemma 4?
Gemini 1.0 by Google DeepMind is strongest where you need vision and multimodal work, while Gemma 4 by Google DeepMind is the closest related family to check for multimodal. Gemini 1.0 has 5 listed variants and reaches up to 1m context, while Gemma 4 reaches up to 256k context, so compare the specs and pricing tables before choosing a production model.
Which Gemini 1.0 model should I use?
For the lowest listed input price, start with Gemini 1.0 Pro on Google Vertex AI through GCP Vertex AI at $0.125/1M input tokens. For the most capable/latest local choice, evaluate Gemini 1.0 Pro on Google Vertex AI with 33k context and structured outputs and multimodal inputs.