LLM Reference

NSQL 350M

About

The NSQL 350M model, developed by Numbers Station, is an open-source large language model tailored for crafting SQL queries from natural language inputs. It belongs to a model family that also includes larger iterations like NSQL 2B and NSQL 6B and is based on Salesforce's CodeGen models. This autoregressive model constructs queries token by token and is recognized for its ability to generate SELECT queries effectively when provided with a table schema and clear instructions. NSQL 350M undergoes initial training using a corpus of general SQL queries and is fine-tuned on a vast dataset consisting of text-to-SQL pairs drawn from diverse public sources. However, while it excels at generating accurate SQL code for moderately complex queries, its efficacy relies heavily on the prompt structure and may falter with highly intricate queries or those outside its training scope. Despite showing strong potential among open-source models, it is yet to match the precision of proprietary models like GPT-4 in tackling complex tasks.

Capabilities

MultimodalFunction CallingTool UseJSON Mode

Providers(1)

ProviderInput (per 1M)Output (per 1M)Type
Baseten API
Serverless

Specifications

FamilyNSQL
Parameters350M
ArchitectureDecoder Only
Specializationgeneral