LLM Reference

NuExtract

Released
2023-11-30
Last refreshed
2026-05-19
Status
Researched 16d ago

NuExtract has model metadata, but missing tracked provider pricing keeps it from being a default production pick.

Use it for

  • Teams evaluating general LLM work

Do not use it for

  • Cost-sensitive launches that need sourced token pricing
  • Vision or document-understanding workloads
  • Strict JSON or tool-calling flows
Specifications
Family
NuExtract
Released
2023-11-30
Parameters
3.8B
Architecture
Decoder Only
Specialization
general
Training
finetuned
Created by

Innovative AI-driven NLP model creator

Paris, France
Founded 2022
Website
Pricing

No tracked provider token pricing is available yet.

About

NuExtract is a collection of lightweight text-to-JSON large language models crafted by NuMind for extracting structured data from unstructured text. It is offered in various sizes: NuExtract-tiny, NuExtract, and NuExtract-large, and supports zero-shot and fine-tuned applications. A notable characteristic is its purely extractive approach, which ensures accuracy by copying output text directly from input, effectively preventing hallucinations. The models employ JSON templates to define necessary information structures, aiding in customizable extraction processes. Trained on a superior synthetic dataset, these models excel in certain tasks compared to larger LLMs. The latest versions, like NuExtract 1.5, offer multilingual capabilities and can handle lengthy documents efficiently.

NuExtract is a model. No headline benchmark score is tracked for NuExtract yet.

Top use-case fit

No primary decision-task fit is mapped for this model yet.

Provider price ladder

No tracked provider token pricing is available for this model yet.

Capabilities

No model capability flags are currently sourced.

Benchmark peer barsfor Coding

No task-mapped benchmark peers are available for this model yet.

Migration checks

No linked migration route is available for this model yet.

Rankings & picks(5)