LLM Reference

Orca 13B

About

Orca 13B, a powerful AI language model developed by Microsoft, leverages a fine-tuned LLaMA-2 base model architecture to mimic the reasoning processes of larger models like GPT-4. Although significantly smaller than GPT-4, it requires fewer computational resources and aims for comparable performance. The model enhances reasoning abilities using a synthetic training dataset moderated by Microsoft Azure content filters. Its capabilities include reasoning, reading comprehension, math problem-solving, and text summarization, though it's not optimized for chat tasks. Orca 13B employs a progressive learning approach, learning from GPT-4's explanation traces to improve accuracy and contextual understanding. However, it may still exhibit biases, hallucinate, and misinterpret scenarios. Different quantized versions are available, offering various trade-offs in speed, accuracy, and memory usage.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Specifications

FamilyOrca
Released2023-06-05
Parameters13B
ArchitectureDecoder Only
Specializationgeneral