LLM Reference
InternLM-XComposer

InternLM-XComposer

About

The InternLM-XComposer family features a suite of advanced large vision-language models (LVLMs) tailored for sophisticated text-image understanding and composition. These models showcase exceptional capabilities in multimodal tasks, performing on par with GPT-4V while utilizing a more compact 7B parameter language model backend. Prominent functionalities include the interpretation of ultra-high resolution images, detailed video analysis, and handling complex multi-turn, multi-image dialogues. Additionally, they can convert text or image instructions into web pages and produce high-quality text-image articles. As open-source models, they are readily accessible for further exploration and innovation in the research community. The latest version, InternLM-XComposer-2.5, offers enhanced performance over its predecessors, particularly in managing longer context scenarios 24.

Models(2)

Details

ResearcherIntern-AI
Models2