Historical run

View current dashboard →

Overall trajectory · last 28 runs

20260420T153553Z_0297f6completed•Apr 20, 2026, 03:35 PM UTC•12/12 executions•rubric v2

perplexity / sonar

Overall

53.9-15.8

weighted mean across 5 dimensions

Visibility

58.3-41.7

first-mention position of ISL

Module recognition

58.3+13.3

WG/FG named = 100, DRx only = 40

Descriptor match

14.6-5.4

rubric vocabulary match, not factual accuracy

Overall mix35Visibility·25Module recognition·20Descriptor match·15Competitor displacement·5Citation quality

Provider comparison — Overall score

Same data as the “By provider” table below, visualized. Each bar is the mean Overall score across that provider’s substantive responses (evasive/empty excluded).

Notable changes vs prior run

MISSaverages_substantive.visibility100.0→58.3-41.7
MISSaverages_substantive.citation_quality60.0→27.5-32.5
MISSaverages_substantive.overall65.0→49.2-15.8
MISSby_provider_overall.perplexity65.0→49.2-15.8
GAINaverages_substantive.module_recognition40.0→53.3+13.3

By provider

Averages across substantive responses for each AI provider. n = executions attempted; n sub = responses that actually answered (evasive/empty/unscorable excluded). Low n sub triggers an amber flag — the average is over too few real responses to trust.

Provider	n	n sub	Overall	Vis	ModRec	Desc	CompDisp	Citation
perplexity	12	12	53.9	58.3	58.3	14.6	97.5	27.5

By category

Same averages as 'By provider', but grouped by prompt category instead. Useful for spotting which capability area (e.g., WeatherGuard, ISL company) is scoring strongest across the provider panel.

Category	n	n sub	Overall	Vis	ModRec	Desc	CompDisp	Citation
isl_company	6	6	66.5	100.0	50.0	10.0	100.0	40.0
weatherguard	6	6	41.3	16.7	66.7	19.2	95.0	15.0

ISL prominence

prominent7
absent5

Module recognition

specific_module_named7
weatherguard_mentioned_total6
none5

Competitor displacement

Competitors tracked: Urbint, Gridware, Space-Time Insight.

no_competitors11
competitor_only1

Citations

“Authoritative” = our 7-domain whitelist, not a broad quality judgement.

no_citations6
any_authoritative5
citations_without_authoritative1

Dimension averages (substantive)

Visibility58.3

Descriptor match14.6

Module recognition58.3

Competitor displacement97.5

Citation quality27.5

Prompt library used in this run

The exact questions sent to each AI provider. Authoritative source: prompts/*.yaml.

20 prompts · 60 variants

ev_prophet

fireguard

isl_company

reliability_optimization

sos

weatherguard

Executions12 of 12

One row per provider × prompt × variant. Click any row to expand the raw response and parsed signals. Scores are for this single execution, not averages.

ProviderCategory

Provider	Prompt	Category	Class	Overall	Vis	ModRec	Desc	CompDisp	Citation
perplexity	isl_002/v1	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	isl_002/v3	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	isl_002/v2	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	wg_002/v1	weatherguard	substantive	78.0	100.0	100.0	0.0	100.0	60.0
perplexity	isl_001/v2	isl_company	substantive	53.0	100.0	0.0	0.0	100.0	60.0
perplexity	isl_001/v1	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
perplexity	isl_001/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
perplexity	wg_002/v3	weatherguard	substantive	48.0	0.0	100.0	40.0	100.0	0.0
perplexity	wg_001/v2	weatherguard	substantive	43.5	0.0	100.0	40.0	70.0	0.0
perplexity	wg_002/v2	weatherguard	substantive	41.5	0.0	100.0	0.0	100.0	30.0
perplexity	wg_001/v3	weatherguard	substantive	19.0	0.0	0.0	20.0	100.0	0.0
perplexity	wg_001/v1	weatherguard	substantive	18.0	0.0	0.0	15.0	100.0	0.0