Overall trajectory · last 28 runs

20260421T182651Z_010227completedApr 21, 2026, 06:26 PM UTC12/12 executionsrubric v2
openai / gpt-4o-mini
Overall
i
40.3-9.9
weighted mean across 5 dimensions
Visibility
i
50.0-7.1
first-mention position of ISL
Module recognition
i
25.0-15.0
WG/FG named = 100, DRx only = 40
Descriptor match
i
7.9-7.9
rubric vocabulary match, not factual accuracy
Overall mix35Visibility·25Module recognition·20Descriptor match·15Competitor displacement·5Citation quality

Provider comparison — Overall score

Same data as the “By provider” table below, visualized. Each bar is the mean Overall score across that provider’s substantive responses (evasive/empty excluded).

Notable changes vs prior run

  • MISSaverages_substantive.citation_quality25.00.0-25.0
  • MISSaverages_substantive.module_recognition40.025.0-15.0
  • MISSby_category_overall.isl_company57.245.0-12.2
  • MISScounts.substantive24.012.0-12.0
  • MISScounts.total24.012.0-12.0

By provider

Averages across substantive responses for each AI provider. n = executions attempted; n sub = responses that actually answered (evasive/empty/unscorable excluded). Low n sub triggers an amber flag — the average is over too few real responses to trust.

Providernn subOverallVisModRecDescCompDispCitation
openai
121240.350.025.07.9100.00.0

By category

Same averages as 'By provider', but grouped by prompt category instead. Useful for spotting which capability area (e.g., WeatherGuard, ISL company) is scoring strongest across the provider panel.

Categorynn subOverallVisModRecDescCompDispCitation
isl_company
6650.0100.00.00.0100.00.0
weatherguard
6630.70.050.015.8100.00.0

ISL prominence

  • prominent6
  • absent6

Module recognition

  • none9
  • specific_module_named3
  • weatherguard_mentioned_total3

Competitor displacement

i

Competitors tracked: Urbint, Gridware, Space-Time Insight.

  • no_competitors12

Citations

i

“Authoritative” = our 7-domain whitelist, not a broad quality judgement.

  • no_citations12

Dimension averages (substantive)

Visibility50.0
Descriptor match7.9
Module recognition25.0
Competitor displacement100.0
Citation quality0.0

Prompt library used in this run

The exact questions sent to each AI provider. Authoritative source: prompts/*.yaml.

20 prompts · 60 variants
ev_prophet
fireguard
isl_company
reliability_optimization
sos
weatherguard

Executions12 of 12

One row per provider × prompt × variant. Click any row to expand the raw response and parsed signals. Scores are for this single execution, not averages.

ProviderPromptCategoryClassOverallVisModRecDescCompDispCitation
openaiisl_001/v2isl_companysubstantive50.0100.00.00.0100.00.0
openaiisl_001/v3isl_companysubstantive50.0100.00.00.0100.00.0
openaiisl_002/v3isl_companysubstantive50.0100.00.00.0100.00.0
openaiisl_001/v1isl_companysubstantive50.0100.00.00.0100.00.0
openaiisl_002/v2isl_companysubstantive50.0100.00.00.0100.00.0
openaiisl_002/v1isl_companysubstantive50.0100.00.00.0100.00.0
openaiwg_002/v3weatherguardsubstantive48.00.0100.040.0100.00.0
openaiwg_002/v2weatherguardsubstantive40.00.0100.00.0100.00.0
openaiwg_002/v1weatherguardsubstantive40.00.0100.00.0100.00.0
openaiwg_001/v3weatherguardsubstantive19.00.00.020.0100.00.0
openaiwg_001/v2weatherguardsubstantive19.00.00.020.0100.00.0
openaiwg_001/v1weatherguardsubstantive18.00.00.015.0100.00.0