SQLCoder 70B Alpha
About
SQLCoder-70B-Alpha is a specialized large language model adept in converting natural language descriptions into SQL queries. Developed by Defog, Inc., it enhances CodeLlama-70B and outperforms generalist models like GPT-4 in text-to-SQL tasks. Based on the LlamaForCausalLM architecture, the model offers a context length of 16384 tokens and has been trained on a curated dataset of 20,000 human-created SQL queries, covering diverse SQL concepts. While potent, it is not suitable for handling malicious requests and should be operated with read-only database access. Early feedback highlights the importance of prompt engineering for achieving the best results.
Capabilities
MultimodalFunction CallingTool UseJSON Mode
Specifications
FamilySQLCoder
Released2024-01-31
Parameters70B
ArchitectureDecoder Only
Specializationgeneral