LLMReference — Data Quality Dashboard
Pulse summarizes weekly release velocity, freshness, cadence, and current frontier pricing. Changelog is the month-by-month archive; this page covers raw field completeness for maintainers.
Headline Coverage
1,901
Total model rows tracked
Full seed catalog, including embeddings and deprecated rows
1,767
Active public models
1,767 non-deprecated language-model rows shown by default on /models
1,864
Public models incl. deprecated
97 deprecated public rows remain available behind the toggle
1,804
Active tracked rows
1,804 non-deprecated rows in the full seed, including embeddings
0
Models added this month
Released in July 2026
1,237
Open-weights models
Self-hostable or open-license rows
140
API providers covered
Provider catalogs and routes
247
Labs and creators
Research labs, companies, and builders
157
Benchmarks covered
Evaluation sources in the seed data
Database Totals
1,767
Active public models
License-Derived Openness
695
Open source
39% of active public models
495
Open weights
28% of active public models
570
Proprietary
32% of active public models
7
Unknown
0% of active public models
| License coverage | Known | Missing / unknown |
|---|
| Active public models | 1,704 (96%) | 63 (4%) |
| Families | 486 (91%) | 51 (9%) |
| OSI classification | Count | Share |
|---|
| OSI-approved effective license | 695 / 1,767 | 39% |
| Non-OSI effective license | 1,009 / 1,767 | 57% |
| Missing / unknown effective license | 63 / 1,767 | 4% |
| Commercial use | Models | Share |
|---|
| Yes | 757 / 1,767 | 43% |
| Conditional | 897 / 1,767 | 51% |
| No | 50 / 1,767 | 3% |
| Unknown / blank | 63 / 1,767 | 4% |
| Verification backlog | Count | Health |
|---|
| License records marked verify:true | 5 / 41 | 5 (12%) |
| License records with unknown openness | 1 / 41 | 1 (2%) |
| License records with unknown commercial use | 1 / 41 | 1 (2%) |
| Models using verify:true licenses | 25 / 1,767 | 25 (1%) |
| Models missing/unknown effective license | 63 / 1,767 | 63 (4%) |
| Families missing/unknown license | 51 / 537 | 51 (9%) |
Model rows use the effective license resolver: model license first, then family license. Missing licenses and licenses classified as unknown remain visible as backlog instead of being inferred from legacy strings.
Data Freshness
| Entity | Total | Placeholder 2026-01-01 | Stale >90d | Stale >180d |
|---|
| Active public models | 1,767 | 235 (13%) | 235 (13%) | 235 (13%) |
| Families | 537 | 137 (26%) | 137 (26%) | 137 (26%) |
| Providers | 140 | 36 (26%) | 36 (26%) | 36 (26%) |
| Benchmarks | 157 | 0 (0%) | 0 (0%) | 0 (0%) |
Missing or invalid lastResearched dates are counted as stale so research queues do not hide unverified rows.
Benchmark Coverage
Source URL coverage
97%
1,264 / 1,303 scores start with http.
Benchmarks with stale score sets
2
- gaokao: 11 rows, latest 2023-05-20
- gaokao-mm: 7 rows, latest 2024-02-23
| Metric | Count | Coverage / Health |
|---|
| Active public models with >=1 benchmark score | 307 / 1,767 | 17% |
| Scores with verified URL source | 1,264 / 1,303 | 97% |
| Benchmarks with zero scores | 63 / 157 | 60% |
| Benchmark scores older than 1 year | 80 / 1,303 | 94% |
| Benchmark scores older than 2 years | 18 / 1,303 | 99% |
| Benchmarks with all scores older than 2 years | 2 / 94 | 98% |
Model Field Completeness
| Group | Field | Filled | Coverage |
|---|
| Core | parameters | 1,281 / 1,767 | 72% |
| Core | context | 1,332 / 1,767 | 75% |
| Core | knowledge_cutoff | 577 / 1,767 | 33% |
| Core | description | 1,767 / 1,767 | 100% |
| Core | release | 1,766 / 1,767 | 100% |
| Core | familySlug | 1,764 / 1,767 | 100% |
| Capability | vision | 298 / 1,767 | 17% |
| Capability | reasoning | 181 / 1,767 | 10% |
| Capability | function_calling | 271 / 1,767 | 15% |
| Capability | tool_use | 251 / 1,767 | 14% |
| Capability | structured_outputs | 417 / 1,767 | 24% |
| Capability | code_execution | 86 / 1,767 | 5% |
| Capability | prompt_caching | 39 / 1,767 | 2% |
| Capability | batch_api | 28 / 1,767 | 2% |
| Capability | audio | 104 / 1,767 | 6% |
| Capability | fine_tuning | 6 / 1,767 | 0% |
| Capability | multimodal | 395 / 1,767 | 22% |
| Metadata | max_output_tokens | 76 / 1,767 | 4% |
| Metadata | licenseSlug | 434 / 1,767 | 25% |
Capability rows count truthy capability flags; empty, 0, and false mean the capability is not currently recorded as present.
Family and Concept Completeness
| Field | Filled | Coverage |
|---|
| family.description | 444 / 537 | 83% |
| family.researcherSlug | 535 / 537 | 100% |
| concept.lastResearched | 26 / 69 | 38% |
Pricing Coverage
| Metric | Count | Coverage / Health |
|---|
| Provider links with pricing | 1,479 / 1,742 | 85% |
| Excludes non-token-priced rows (per-image, per-second, gpu-hour, etc.). |
| Models with ≥1 pricing record | 735 / 1,767 | 42% |
| Provider links with model ID (providerModelId) | 819 / 1,920 | 43% |
| Non-token links missing all prices | 285 / 1,920 | 85% |
| Provider links with $0 token price | 24 / 1,920 | 99% |
Missing-price rows have no token, image, video, audio, query, or gpu-hour price recorded. $0 token rows may include true free-tier entries and should be reviewed before changing seed data.
Data Quality
| Indicator | Count |
|---|
| Duplicate model slugs | 0 |
| Duplicate model+provider pairs | 0 |
| Models without family | 3 |
| Families missing description | 93 |
| Concepts missing lastResearched | 43 |
| Broken family references (model→family) | 0 |
| Broken researcher references (family→researcher) | 0 |
| Broken provider references (link→provider) | 0 |
| Broken model references (link→model) | 0 |
Provider Link Completeness
| Field | Filled | Coverage |
|---|
| docs | 92 / 140 | 66% |
| pricing | 83 / 140 | 59% |
| portal | 81 / 140 | 58% |
| tier | 118 / 140 | 84% |
| providerType | 123 / 140 | 88% |
Provider Coverage (top 25 by model count)
| Provider | Models | Priced | Pricing % |
|---|
| OpenRouter | 216 | 212 | 98% |
| Fireworks AI | 211 | 207 | 98% |
| Vercel AI Gateway | 150 | 140 | 93% |
| AWS Bedrock | 108 | 96 | 89% |
| GCP Vertex AI | 108 | 83 | 77% |
| Microsoft Foundry | 105 | 78 | 74% |
| Together AI | 100 | 98 | 98% |
| Novita AI | 99 | 99 | 100% |
| Replicate API | 98 | 85 | 87% |
| DeepInfra | 52 | 49 | 94% |
| Alibaba Cloud PAI-EAS | 44 | 23 | 52% |
| Google AI Studio | 38 | 19 | 50% |
| Cloudflare Workers AI | 32 | 17 | 53% |
| OpenAI API | 31 | 31 | 100% |
| IBM watsonx | 31 | 31 | 100% |
| Mistral AI Studio | 18 | 16 | 89% |
| Bitdeer AI | 18 | 18 | 100% |
| Anthropic | 16 | 14 | 88% |
| Baseten API | 13 | 0 | 0% |
| SiliconFlow | 13 | 13 | 100% |
| OctoAI API (Deprecated) | 12 | 12 | 100% |
| Lepton AI API | 12 | 12 | 100% |
| Azure OpenAI | 11 | 5 | 45% |
| Databricks Foundation Model Serving | 11 | 5 | 45% |
| Baidu Qianfan | 10 | 10 | 100% |
Model Releases by Month
| Month | New Models |
|---|
| 2026-06 | 53 |
| 2026-05 | 40 |
| 2026-04 | 62 |
| 2026-03 | 59 |
| 2026-02 | 38 |
| 2026-01 | 31 |
| 2025-12 | 48 |
| 2025-11 | 35 |
| 2025-10 | 31 |
| 2025-09 | 29 |
| 2025-08 | 29 |
| 2025-07 | 17 |
| 2025-06 | 18 |
| 2025-05 | 17 |
| 2025-04 | 44 |
| 2025-03 | 41 |
| 2025-02 | 17 |
| 2025-01 | 75 |
| 2024-12 | 35 |
| 2024-11 | 38 |
| 2024-10 | 43 |
| 2024-09 | 55 |
| 2024-08 | 39 |
| 2024-07 | 51 |
| 2024-06 | 77 |
| 2024-05 | 53 |
| 2024-04 | 57 |
| 2024-03 | 39 |
| 2024-02 | 64 |
| 2024-01 | 81 |
| 2023-12 | 80 |
| 2023-11 | 67 |
| 2023-10 | 45 |
| 2023-09 | 15 |
| 2023-08 | 35 |
| 2023-07 | 37 |
| 2023-06 | 23 |
| 2023-05 | 19 |
| 2023-04 | 28 |
| 2023-03 | 29 |
| 2023-02 | 2 |
| 2023-01 | 12 |
| 2022-12 | 3 |
| 2022-11 | 1 |
| 2022-10 | 12 |
| 2022-09 | 1 |
| 2022-07 | 7 |
| 2022-04 | 2 |
| 2022-03 | 2 |
| 2022-02 | 1 |
| 2022-01 | 1 |
| 2021-08 | 5 |
| 2021-05 | 3 |
| 2020-06 | 5 |
| 2020-01 | 5 |
| 2019-11 | 1 |
| 2019-08 | 4 |
| 2019-02 | 2 |
| 2018-10 | 2 |
| 2018-06 | 1 |
Generated at build time from seed JSON. Last updated: 2026-07-02.