Overall trajectory · last 28 runs

20260420T153553Z_0297f6completedApr 20, 2026, 03:35 PM UTC12/12 executionsrubric v2
perplexity / sonar
Overall
i
53.9-15.8
weighted mean across 5 dimensions
Visibility
i
58.3-41.7
first-mention position of ISL
Module recognition
i
58.3+13.3
WG/FG named = 100, DRx only = 40
Descriptor match
i
14.6-5.4
rubric vocabulary match, not factual accuracy
Overall mix35Visibility·25Module recognition·20Descriptor match·15Competitor displacement·5Citation quality

Provider comparison — Overall score

Same data as the “By provider” table below, visualized. Each bar is the mean Overall score across that provider’s substantive responses (evasive/empty excluded).

Notable changes vs prior run

  • MISSaverages_substantive.visibility100.058.3-41.7
  • MISSaverages_substantive.citation_quality60.027.5-32.5
  • MISSaverages_substantive.overall65.049.2-15.8
  • MISSby_provider_overall.perplexity65.049.2-15.8
  • GAINaverages_substantive.module_recognition40.053.3+13.3

By provider

Averages across substantive responses for each AI provider. n = executions attempted; n sub = responses that actually answered (evasive/empty/unscorable excluded). Low n sub triggers an amber flag — the average is over too few real responses to trust.

Providernn subOverallVisModRecDescCompDispCitation
perplexity
121253.958.358.314.697.527.5

By category

Same averages as 'By provider', but grouped by prompt category instead. Useful for spotting which capability area (e.g., WeatherGuard, ISL company) is scoring strongest across the provider panel.

Categorynn subOverallVisModRecDescCompDispCitation
isl_company
6666.5100.050.010.0100.040.0
weatherguard
6641.316.766.719.295.015.0

ISL prominence

  • prominent7
  • absent5

Module recognition

  • specific_module_named7
  • weatherguard_mentioned_total6
  • none5

Competitor displacement

i

Competitors tracked: Urbint, Gridware, Space-Time Insight.

  • no_competitors11
  • competitor_only1

Citations

i

“Authoritative” = our 7-domain whitelist, not a broad quality judgement.

  • no_citations6
  • any_authoritative5
  • citations_without_authoritative1

Dimension averages (substantive)

Visibility58.3
Descriptor match14.6
Module recognition58.3
Competitor displacement97.5
Citation quality27.5

Prompt library used in this run

The exact questions sent to each AI provider. Authoritative source: prompts/*.yaml.

20 prompts · 60 variants
ev_prophet
fireguard
isl_company
reliability_optimization
sos
weatherguard

Executions12 of 12

One row per provider × prompt × variant. Click any row to expand the raw response and parsed signals. Scores are for this single execution, not averages.

ProviderPromptCategoryClassOverallVisModRecDescCompDispCitation
perplexityisl_002/v1isl_companysubstantive82.0100.0100.020.0100.060.0
perplexityisl_002/v3isl_companysubstantive82.0100.0100.020.0100.060.0
perplexityisl_002/v2isl_companysubstantive82.0100.0100.020.0100.060.0
perplexitywg_002/v1weatherguardsubstantive78.0100.0100.00.0100.060.0
perplexityisl_001/v2isl_companysubstantive53.0100.00.00.0100.060.0
perplexityisl_001/v1isl_companysubstantive50.0100.00.00.0100.00.0
perplexityisl_001/v3isl_companysubstantive50.0100.00.00.0100.00.0
perplexitywg_002/v3weatherguardsubstantive48.00.0100.040.0100.00.0
perplexitywg_001/v2weatherguardsubstantive43.50.0100.040.070.00.0
perplexitywg_002/v2weatherguardsubstantive41.50.0100.00.0100.030.0
perplexitywg_001/v3weatherguardsubstantive19.00.00.020.0100.00.0
perplexitywg_001/v1weatherguardsubstantive18.00.00.015.0100.00.0