When was InternLM XComposer2 VL 7B released?

InternLM XComposer2 VL 7B was released on 2024-04-09.

What benchmarks has InternLM XComposer2 VL 7B been tested on?

InternLM XComposer2 VL 7B has been evaluated on 1 benchmark, including GAOKAO-MM.

InternLM XComposer2 VL 7B

Name: InternLM XComposer2 VL 7B
Author: Intern-AI

Released

2024-04-09

Last refreshed

2026-04-19

Status

Researched 154d ago

Vision

InternLM XComposer2 VL 7B has model metadata, but missing tracked provider pricing keeps it from being a default production pick.

Use it for

Teams evaluating vision

Do not use it for

Cost-sensitive launches that need sourced token pricing
Vision or document-understanding workloads
Strict JSON or tool-calling flows

Specifications

Family: InternLM-XComposer2
Released: 2024-04-09
Parameters: 7B
Architecture: Decoder Only
Knowledge cutoff: 2023-08
Specialization: general
Training: finetuned

Created by

Intern-AI

Innovative AI research for societal impact

Shanghai, China

Founded 2023

Website

Pricing

No tracked provider token pricing is available yet.

About

InternLM-XComposer2-VL-7B is an advanced vision-language large model (VLLM) built on InternLM2 architecture, designed for robust text-image comprehension and composition. It leverages Partial LoRA (P-LoRA) to align embedding spaces effectively between a pre-trained Vision Transformer (ViT) and the language model, enhancing multimodal understanding. The model undergoes pretraining to refine general semantics and improve visual capabilities using datasets like COCO and TextCaps, followed by supervised fine-tuning with various vision-language tasks. It excels in image captioning, visual question answering, and creative text-image compositions, capable of handling high-resolution images and fine-grained details. The InternLM-XComposer2-VL-7B family includes a 4-bit quantized version for reduced VRAM usage, along with other variants for high-resolution understanding and long-contextual inputs.

InternLM XComposer2 VL 7B is a model in the InternLM-XComposer2 family. Headline tracked benchmarks include GAOKAO-MM 33.2.

Top use-case fit

Vision

1 relevant benchmark in the decision map.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Vision

GAOKAO-MMRank 4 of 7

GPT-4 Vision Preview

48.1

Qwen-VL-Plus

41.2

Gemini 1.0 Pro Vision

35.1

InternLM XComposer2 VL 7Bcurrent

33.2

LLaVA 1.5 13B

18.0

Benchmark scores(1)

Scores are benchmark-specific and are direction-aware: the same numeric gap can mean very different outcomes across suites. Use the leaderboard context and this model's provider route to decide whether the winning margin is meaningful for your workload.

Benchmark	Score	Version	Source
GAOKAO-MM	33.2	zero-shot	https://github.com/OpenMOSS/GAOKAO-MM

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(5)

Best Small Language Models (SLMs)Listed Cheapest LLM APIs You Can Call Right NowListed Best Mainstream LLM APIs, RankedListed Best LLMs for WritingListed Best LLMs for MarketingListed