LLM Reference
Mistral

Mistral

MistralAIOpen SourceHighlight

About

Mistral AI, a Paris-based startup, has created a remarkable family of large language models (LLMs) known for their efficiency and high performance. Their flagship model, Mistral 7B, features 7.3 billion parameters and delivers superior performance compared to other models of a similar size, challenging even larger models in specific benchmarks 245. The model employs cutting-edge attention mechanisms, such as Grouped-query Attention (GQA) and Sliding Window Attention (SWA), to ensure faster inference and efficiently manage longer sequences 478. Licensed under Apache 2.0, Mistral 7B is accessible for community use and contributions 245. Additionally, Mistral AI provides instruction-tuned versions like Mistral 7B Instruct, designed for chat and question-answering tasks 57. The company is continuously expanding its model lineup, including newer models like Pixtral, which features multimodal capabilities 13. Mistral's relentless pursuit of efficiency and performance makes its LLMs significant assets for researchers and developers alike 6910.

Models(14)

Details

ResearcherMistralAI
Models14