Haotian Liu

10 models across 3 families · Latest: LLaVA 1.6 Vicuna 7B (2024-01)

Researched 60d ago

Academic researcher focused on vision models

VisionClassificationJSON / Tool useIndividual

Haotian Liu's portfolio covers 8 active models across 3 current families, spanning vision, classification, and json / tool use. Open a model detail page to compare provider routes and sourced benchmarks.
Covers 3 workload areas across 8 active tracked models; last verified 2026-05-19.

Use it for

Teams evaluating vision, classification, and json / tool use across this lab's releases
Comparing model families before committing to a flagship
Migration and pricing follow-ups across 8 tracked models

Do not use it for

Choosing a hosting provider without opening a model page for price ladders

Active models

Current models from this lab, excluding deprecated ones

Active families

Current model families from this lab

Open catalog

8 open

2 open source / 6 open weights

Lowest output price

$0.150 /1M

Cheapest tracked output across active models, per 1M tokens

Latest dated release

2024-01-31

LLaVA 1.6 Vicuna 7B

Freshness

2026-05-19

Researched 60d ago

aging

Information

FoundedN/A

N/A

Links

Website GitHub X / Twitter LinkedIn HuggingFace Crunchbase

Release cadence

Showing 5 recent dated releases (full timeline below). Latest: LLaVA 1.6 Vicuna 7B (2024-01-31).

Where this lab wins

Vision: 3 tracked models with multimodal benchmark coverage.
Classification: 1 tracked model with MMLU-class moderation/safety coverage.
JSON/tool-use: 1 tracked model with BFCL / Nexus strict-JSON routing coverage.

Flagship quality / price signal

Flagship: LLaVA Vicuna 13B (best sourced coding quality-per-dollar in this portfolio).

Quality-per-dollar unavailable for this flagship — benchmark coverage or output token pricing is still missing.

Haotian Liu is an AI research organization founded in N/A. Academic researcher focused on vision models. Haotian Liu ships 3 model families totaling 10 models, with the most recent release LLaVA 1.6 Vicuna 7B in 2024-01. Notable families include LLaVA 1.6, LLaVA 1.5, and LLaVA. Use it as a stable reference for lab background, release coverage, and follow-up model pages as they. View official API endpoints, benchmark performance, and coding/agent fit for every Haotian Liu model.

About

Haotian Liu is a distinguished AI researcher who has made substantial contributions to the fields of generative AI and large language models (LLMs). His academic path was marked by rigorous training and research in computer vision and machine learning, commencing with a bachelor's degree at Zhejiang University. Liu advanced this knowledge during his Ph.D. at the University of Wisconsin-Madison, under the mentorship of Professor Yong Jae Lee, where he honed his expertise in integrating visual and textual data. A hallmark of Liu's research is the development of the Large Language and Vision Assistant (LLaVA). This cutting-edge platform employs sophisticated visual instruction tuning methods to boost the performance of LLMs, allowing them to interpret and generate responses based on visual input effectively. His work aims for these AI systems to rival the comprehension level of models like GPT-4, thereby facilitating complex interactions that merge linguistic and visual elements. This advancement is critical as it enhances the AI’s contextual understanding and reasoning capabilities, which are vital for applications in diverse sectors, including biomedical research and education. In his pursuit of applying AI to specialized fields, Liu spearheaded the creation of LLaVA-Med, a variant of LLaVA designed specifically for biomedical use. This model harnesses large-scale datasets derived from PubMed Central, enhancing its ability to address intricate biomedical image-related inquiries. A notable feature of Liu’s methodology is the use of a curriculum learning strategy that allows the model to adapt from simpler tasks to more complex ones, emulating human learning processes in biomedical sciences. This innovative approach underscores Liu’s dedication to developing accessible, domain-centric AI tools for professionals. Beyond these critical projects, Liu's research portfolio includes papers focused on applying generative AI to practical challenges, particularly the fusion of multiple modalities for real-world applications such as visual question answering and image captioning. His endeavors emphasize bridging the gap between language and image understanding, aiming to forge AI systems that respond precisely to intricate human queries with contextual relevance. Haotian Liu’s work is characterized by a profound emphasis on multimodal learning and the personalization of AI systems to align with user needs. His pioneering methods are transforming AI into a more intelligent and interactive tool that can support professionals across various domains, especially where detailed domain knowledge is required. As the field of generative AI continues to expand, Liu’s efforts are instrumental in steering the future of AI development both in academia and applied settings.

Featured models

Model	Released	Context	Input price ($/1M)	Output price ($/1M)	License	Openness
LLaVA 1.6 Vicuna 7B	2024-01-31	4k	$0.05	$0.25	Apache 2.0	Open source
LLaVA 1.6 Vicuna 13B	2024-01-31	4k	$0.10	$0.50	Apache 2.0	Open source
LLaVA 1.6 Mistral 7B	2024-01-31	32k	$0.05	$0.25	Apache 2.0	Open source