Historical run

View current dashboard →

Overall trajectory · last 28 runs

20260422T151904Z_66e137completed_with_errors•Apr 22, 2026, 03:19 PM UTC•36/48 executions · 12 failed•rubric v2

perplexity / sonaropenai / gpt-4o-minianthropic / claude-sonnet-4-6gemini / gemini-2.5-flash

Overall

48.1+0.6

weighted mean across 5 dimensions

Visibility

55.6+0.9

first-mention position of ISL

Module recognition

40.0+0.2

WG/FG named = 100, DRx only = 40

Descriptor match

15.0-0.1

rubric vocabulary match, not factual accuracy

Overall mix35Visibility·25Module recognition·20Descriptor match·15Competitor displacement·5Citation quality

Provider comparison — Overall score

Same data as the “By provider” table below, visualized. Each bar is the mean Overall score across that provider’s substantive responses (evasive/empty excluded).

Notable changes vs prior run

GAINcitations.no_citations29.0→24.0-5.0
MISScounts.substantive41.0→36.0-5.0
MISScounts.total41.0→36.0-5.0
MISScompetitor_displacement.no_competitors39.0→35.0-4.0
GAINmodule_recognition.none25.0→21.0-4.0

By provider

Averages across substantive responses for each AI provider. n = executions attempted; n sub = responses that actually answered (evasive/empty/unscorable excluded). Low n sub triggers an amber flag — the average is over too few real responses to trust.

Provider	n	n sub	Overall	Vis	ModRec	Desc	CompDisp	Citation
anthropic	12	12	40.8	50.0	25.0	10.0	100.0	0.0
openai	12	12	40.1	50.0	25.0	6.7	100.0	0.0
perplexity	12	12	63.6	66.7	70.0	28.3	97.5	50.0

By category

Same averages as 'By provider', but grouped by prompt category instead. Useful for spotting which capability area (e.g., WeatherGuard, ISL company) is scoring strongest across the provider panel.

Category	n	n sub	Overall	Vis	ModRec	Desc	CompDisp	Citation
isl_company	18	18	59.6	100.0	30.0	5.6	100.0	20.0
weatherguard	18	18	36.7	11.1	50.0	24.4	98.3	13.3

ISL prominence

prominent20
absent16

Module recognition

none21
specific_module_named14
weatherguard_mentioned_total10
drx_umbrella_only1

Competitor displacement

Competitors tracked: Urbint, Gridware, Space-Time Insight.

no_competitors35
competitor_only1

Citations

“Authoritative” = our 7-domain whitelist, not a broad quality judgement.

no_citations24
any_authoritative8
citations_without_authoritative4

Dimension averages (substantive)

Visibility55.6

Descriptor match15.0

Module recognition40.0

Competitor displacement99.2

Citation quality16.7

Prompt library used in this run

The exact questions sent to each AI provider. Authoritative source: prompts/*.yaml.

20 prompts · 60 variants

ev_prophet

fireguard

isl_company

reliability_optimization

sos

weatherguard

Executions36 of 36

One row per provider × prompt × variant. Click any row to expand the raw response and parsed signals. Scores are for this single execution, not averages.

ProviderCategory

Provider	Prompt	Category	Class	Overall	Vis	ModRec	Desc	CompDisp	Citation
perplexity	wg_002/v1	weatherguard	substantive	86.0	100.0	100.0	40.0	100.0	60.0
perplexity	wg_002/v3	weatherguard	substantive	86.0	100.0	100.0	40.0	100.0	60.0
perplexity	isl_002/v1	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	isl_002/v3	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	isl_001/v3	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	isl_002/v2	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	isl_001/v2	isl_company	substantive	78.0	100.0	100.0	0.0	100.0	60.0
perplexity	isl_001/v1	isl_company	substantive	67.0	100.0	40.0	20.0	100.0	60.0
openai	isl_002/v1	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_002/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	isl_002/v2	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	isl_001/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	isl_001/v1	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_002/v2	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	isl_001/v2	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_001/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	isl_002/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_001/v1	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_001/v2	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_002/v1	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	wg_002/v3	weatherguard	substantive	48.0	0.0	100.0	40.0	100.0	0.0
anthropic	wg_002/v3	weatherguard	substantive	48.0	0.0	100.0	40.0	100.0	0.0
perplexity	wg_002/v2	weatherguard	substantive	41.5	0.0	100.0	0.0	100.0	30.0
openai	wg_002/v2	weatherguard	substantive	40.0	0.0	100.0	0.0	100.0	0.0
openai	wg_002/v1	weatherguard	substantive	40.0	0.0	100.0	0.0	100.0	0.0
anthropic	wg_002/v1	weatherguard	substantive	40.0	0.0	100.0	0.0	100.0	0.0
anthropic	wg_002/v2	weatherguard	substantive	40.0	0.0	100.0	0.0	100.0	0.0
perplexity	wg_001/v1	weatherguard	substantive	32.5	0.0	0.0	80.0	100.0	30.0
perplexity	wg_001/v2	weatherguard	substantive	24.5	0.0	0.0	40.0	100.0	30.0
anthropic	wg_001/v1	weatherguard	substantive	23.0	0.0	0.0	40.0	100.0	0.0
openai	wg_001/v1	weatherguard	substantive	23.0	0.0	0.0	40.0	100.0	0.0
perplexity	wg_001/v3	weatherguard	substantive	20.0	0.0	0.0	40.0	70.0	30.0
anthropic	wg_001/v2	weatherguard	substantive	19.0	0.0	0.0	20.0	100.0	0.0
anthropic	wg_001/v3	weatherguard	substantive	19.0	0.0	0.0	20.0	100.0	0.0
openai	wg_001/v3	weatherguard	substantive	15.0	0.0	0.0	0.0	100.0	0.0
openai	wg_001/v2	weatherguard	substantive	15.0	0.0	0.0	0.0	100.0	0.0