LLM Reference

RedPajama INCITE 3B

About

RedPajama-INCITE 3B is an open-source large language model designed by Together Computer, featuring 2.8 billion parameters and employing the Pythia architecture, a variant of GPT-3. Utilizing fully dense attention layers, FlashAttention, Rotary Positional Embeddings, parallelized attention, and untied embedding matrices, it excels in various natural language processing tasks like question answering, text generation, summarization, and conversational AI. Trained on a diverse 1.2 trillion token dataset, it includes content primarily in English alongside 20 other languages. Despite its advantages in accessibility and deployment due to its relatively small size, it exhibits limitations compared to larger models and requires responsible use.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Specifications

FamilyRedPajama
Parameters3b
ArchitectureDecoder Only
Specializationgeneral