NuExtract Large
NuExtract Large has model metadata, but missing tracked provider pricing keeps it from being a default production pick.
Use it for
- Teams evaluating general LLM work
Do not use it for
- Cost-sensitive launches that need sourced token pricing
- Vision or document-understanding workloads
- Strict JSON or tool-calling flows
- Family
- NuExtract
- Released
- 2023-11-30
- Parameters
- 7B
- Architecture
- Decoder Only
- Specialization
- general
- Training
- finetuned
About
NuExtract Large is an advanced information extraction model that derives from the Phi-3-small model. It specializes in turning unstructured text into JSON format, leveraging a JSON template to define the output information structure. The model is purely extractive, meaning it can only extract text that is present in the input, and it is capable of handling input texts up to 2000 tokens. This makes it ideal for tasks such as automated data entry, text summarization, and enhancing search systems. Despite being fine-tuned from a small-scale model, NuExtract Large outperforms some larger models on specific tasks, showcasing its efficiency and effectiveness. It also has companion models like NuExtract and NuExtract-tiny, and a multilingual version, NuExtract 1.5, developed to overcome limitations of its predecessor.
NuExtract Large is a model in the NuExtract family. No headline benchmark score is tracked for NuExtract Large yet.
Top use-case fit
No primary decision-task fit is mapped for this model yet.
Provider price ladder
No tracked provider token pricing is available for this model yet.
Capabilities
No model capability flags are currently sourced.
Benchmark peer barsfor Coding
No task-mapped benchmark peers are available for this model yet.
Migration checks
No linked migration route is available for this model yet.