LLM ReferenceLLM Reference
Replicate API

Dolly 2.0 12B on Replicate API

Dolly 2.0 · Databricks Mosaic

Serverless

Last refreshed 2026-04-15. Next refresh: weekly.

Why use Dolly 2.0 12B on Replicate API?

Replicate API offers Dolly 2.0 12B with pay-as-you-go pricing at $0.10/1M input tokens. Replicate is a cloud-based platform that enables users to run machine learning models easily and efficiently.

Compare Dolly 2.0 12B across 2 providers to find the best fit for your use case
Input / 1M
$0.10
Output / 1M
$0.50
Cache
Not sourced
Batch
Not sourced

Setup recipe

Python + curl
Install
pip install replicate
Auth
export REPLICATE_API_TOKEN=...
Call
import replicate
output = replicate.run(
    "dolly-2.0-12b",
    input={"prompt": "Hello"}
Model ID
dolly-2.0-12b

Request example

import replicate

# reads REPLICATE_API_TOKEN from env
# dolly-2.0-12b format: "owner/model-name" (latest version) or "owner/model-name:version-hash"
output = replicate.run(
    "dolly-2.0-12b",
    input={"prompt": "Hello"}
)
# Output is a list or generator depending on the model
print("".join(output))

Gotchas

  • Replicate uses "owner/model-name" format (e.g. "meta/meta-llama-3-8b-instruct") for the latest version, or "owner/model-name:version-sha" to pin to a specific version. The REST endpoint splits owner and model-name into the path: /v1/models/{owner}/{model-name}/predictions.
  • The examples expect REPLICATE_API_TOKEN; rename it only if your application config maps the new variable.

Compare Dolly 2.0 12B Across Providers

ProviderInput (per 1M)Output (per 1M)
Microsoft Foundry$0.07$0.07
Replicate API$0.10$0.50

Pricing

TypePrice (per 1M)
Input tokens$0.10
Output tokens$0.50

Capabilities

No model capability flags are currently sourced.

About Dolly 2.0 12B

Dolly 2.0 12B is an instruction-following large language model developed by Databricks. It is built on the Pythia-12b architecture with 12 billion parameters and trained on around 15,000 instruction-response pairs crafted by Databricks employees. This model is adept at handling tasks such as brainstorming, classification, open and closed question answering, text generation, information extraction, and summarization. Despite its versatility, Dolly 2.0 12B is not considered state-of-the-art and struggles with more complex prompts, programming, mathematical tasks, factual accuracy, and nuanced tasks like humor. It is notable for its open-source license, permitting commercial use, though it may carry biases from its training data.

FAQ

What does Dolly 2.0 12B cost on Replicate API?

On Replicate API, Dolly 2.0 12B costs $0.10 per 1M input tokens and $0.50 per 1M output tokens.

How does Replicate API compare to other Dolly 2.0 12B providers?

Dolly 2.0 12B is available from 2 providers. The cheapest input pricing is $0.07/1M tokens from Microsoft Foundry.

Who created Dolly 2.0 12B?

Dolly 2.0 12B was created by Databricks Mosaic as part of the Dolly 2.0 model family.

Is Dolly 2.0 12B open source?

Dolly 2.0 12B's open source status is unknown in the seed data.

Get Started

Model Specs

Released2023-04-12
Parameters12b
ArchitectureDecoder Only