Overall trajectory · last 28 runs

20260422T151904Z_66e137completed_with_errorsApr 22, 2026, 03:19 PM UTC36/48 executions · 12 failedrubric v2
perplexity / sonaropenai / gpt-4o-minianthropic / claude-sonnet-4-6gemini / gemini-2.5-flash
Overall
i
48.1+0.6
weighted mean across 5 dimensions
Visibility
i
55.6+0.9
first-mention position of ISL
Module recognition
i
40.0+0.2
WG/FG named = 100, DRx only = 40
Descriptor match
i
15.0-0.1
rubric vocabulary match, not factual accuracy
Overall mix35Visibility·25Module recognition·20Descriptor match·15Competitor displacement·5Citation quality

Provider comparison — Overall score

Same data as the “By provider” table below, visualized. Each bar is the mean Overall score across that provider’s substantive responses (evasive/empty excluded).

Notable changes vs prior run

  • GAINcitations.no_citations29.024.0-5.0
  • MISScounts.substantive41.036.0-5.0
  • MISScounts.total41.036.0-5.0
  • MISScompetitor_displacement.no_competitors39.035.0-4.0
  • GAINmodule_recognition.none25.021.0-4.0

By provider

Averages across substantive responses for each AI provider. n = executions attempted; n sub = responses that actually answered (evasive/empty/unscorable excluded). Low n sub triggers an amber flag — the average is over too few real responses to trust.

Providernn subOverallVisModRecDescCompDispCitation
anthropic
121240.850.025.010.0100.00.0
openai
121240.150.025.06.7100.00.0
perplexity
121263.666.770.028.397.550.0

By category

Same averages as 'By provider', but grouped by prompt category instead. Useful for spotting which capability area (e.g., WeatherGuard, ISL company) is scoring strongest across the provider panel.

Categorynn subOverallVisModRecDescCompDispCitation
isl_company
181859.6100.030.05.6100.020.0
weatherguard
181836.711.150.024.498.313.3

ISL prominence

  • prominent20
  • absent16

Module recognition

  • none21
  • specific_module_named14
  • weatherguard_mentioned_total10
  • drx_umbrella_only1

Competitor displacement

i

Competitors tracked: Urbint, Gridware, Space-Time Insight.

  • no_competitors35
  • competitor_only1

Citations

i

“Authoritative” = our 7-domain whitelist, not a broad quality judgement.

  • no_citations24
  • any_authoritative8
  • citations_without_authoritative4

Dimension averages (substantive)

Visibility55.6
Descriptor match15.0
Module recognition40.0
Competitor displacement99.2
Citation quality16.7

Prompt library used in this run

The exact questions sent to each AI provider. Authoritative source: prompts/*.yaml.

20 prompts · 60 variants
ev_prophet
fireguard
isl_company
reliability_optimization
sos
weatherguard

Executions36 of 36

One row per provider × prompt × variant. Click any row to expand the raw response and parsed signals. Scores are for this single execution, not averages.

ProviderPromptCategoryClassOverallVisModRecDescCompDispCitation
perplexitywg_002/v1weatherguardsubstantive86.0100.0100.040.0100.060.0
perplexitywg_002/v3weatherguardsubstantive86.0100.0100.040.0100.060.0
perplexityisl_002/v1isl_companysubstantive82.0100.0100.020.0100.060.0
perplexityisl_002/v3isl_companysubstantive82.0100.0100.020.0100.060.0
perplexityisl_001/v3isl_companysubstantive82.0100.0100.020.0100.060.0
perplexityisl_002/v2isl_companysubstantive82.0100.0100.020.0100.060.0
perplexityisl_001/v2isl_companysubstantive78.0100.0100.00.0100.060.0
perplexityisl_001/v1isl_companysubstantive67.0100.040.020.0100.060.0
openaiisl_002/v1isl_companysubstantive50.0100.00.00.0100.00.0
anthropicisl_002/v3isl_companysubstantive50.0100.00.00.0100.00.0
openaiisl_002/v2isl_companysubstantive50.0100.00.00.0100.00.0
openaiisl_001/v3isl_companysubstantive50.0100.00.00.0100.00.0
openaiisl_001/v1isl_companysubstantive50.0100.00.00.0100.00.0
anthropicisl_002/v2isl_companysubstantive50.0100.00.00.0100.00.0
openaiisl_001/v2isl_companysubstantive50.0100.00.00.0100.00.0
anthropicisl_001/v3isl_companysubstantive50.0100.00.00.0100.00.0
openaiisl_002/v3isl_companysubstantive50.0100.00.00.0100.00.0
anthropicisl_001/v1isl_companysubstantive50.0100.00.00.0100.00.0
anthropicisl_001/v2isl_companysubstantive50.0100.00.00.0100.00.0
anthropicisl_002/v1isl_companysubstantive50.0100.00.00.0100.00.0
openaiwg_002/v3weatherguardsubstantive48.00.0100.040.0100.00.0
anthropicwg_002/v3weatherguardsubstantive48.00.0100.040.0100.00.0
perplexitywg_002/v2weatherguardsubstantive41.50.0100.00.0100.030.0
openaiwg_002/v2weatherguardsubstantive40.00.0100.00.0100.00.0
openaiwg_002/v1weatherguardsubstantive40.00.0100.00.0100.00.0
anthropicwg_002/v1weatherguardsubstantive40.00.0100.00.0100.00.0
anthropicwg_002/v2weatherguardsubstantive40.00.0100.00.0100.00.0
perplexitywg_001/v1weatherguardsubstantive32.50.00.080.0100.030.0
perplexitywg_001/v2weatherguardsubstantive24.50.00.040.0100.030.0
anthropicwg_001/v1weatherguardsubstantive23.00.00.040.0100.00.0
openaiwg_001/v1weatherguardsubstantive23.00.00.040.0100.00.0
perplexitywg_001/v3weatherguardsubstantive20.00.00.040.070.030.0
anthropicwg_001/v2weatherguardsubstantive19.00.00.020.0100.00.0
anthropicwg_001/v3weatherguardsubstantive19.00.00.020.0100.00.0
openaiwg_001/v3weatherguardsubstantive15.00.00.00.0100.00.0
openaiwg_001/v2weatherguardsubstantive15.00.00.00.0100.00.0