MetaMath Mistral 7B
About
MetaMath Mistral 7B is a large language model specialized in mathematical reasoning and problem-solving, fine-tuned on the MetaMathQA dataset. Based on the Mistral-7B architecture, it employs features like sliding window attention and rolling buffer KV cache to enhance efficiency and reduce memory usage. It achieves a notable 77.7% pass@1 score on the GSM8K benchmark, surpassing previous models. Its capabilities make it suitable for educational tools, such as intelligent math assistants, and its open-source availability under the Apache 2.0 license offers flexibility for developers.
Capabilities
MultimodalFunction CallingTool UseJSON Mode