Historical run

View current dashboard →

Overall trajectory · last 28 runs

20260421T190515Z_8222becompleted_with_errors•Apr 21, 2026, 07:05 PM UTC•41/48 executions · 7 failed•rubric v2

perplexity / sonaropenai / gpt-4o-minianthropic / claude-sonnet-4-6gemini / gemini-2.5-flash

Overall

47.0+4.5

weighted mean across 5 dimensions

Visibility

54.6-0.9

first-mention position of ISL

Module recognition

37.6+10.9

WG/FG named = 100, DRx only = 40

Descriptor match

15.1+4.0

rubric vocabulary match, not factual accuracy

Overall mix35Visibility·25Module recognition·20Descriptor match·15Competitor displacement·5Citation quality

Provider comparison — Overall score

Same data as the “By provider” table below, visualized. Each bar is the mean Overall score across that provider’s substantive responses (evasive/empty excluded).

Notable changes vs prior run

GAINcounts.substantive9.0→41.0+32.0
GAINcounts.total9.0→41.0+32.0
GAINcompetitor_displacement.no_competitors9.0→39.0+30.0
MISScitations.no_citations9.0→29.0+20.0
MISSmodule_recognition.none7.0→25.0+18.0

By provider

Averages across substantive responses for each AI provider. n = executions attempted; n sub = responses that actually answered (evasive/empty/unscorable excluded). Low n sub triggers an amber flag — the average is over too few real responses to trust.

Provider	n	n sub	Overall	Vis	ModRec	Desc	CompDisp	Citation
anthropic	12	12	40.7	50.0	25.0	11.7	97.5	0.0
gemini	5	5	44.2	60.0	20.0	16.0	100.0	0.0
openai	12	12	40.4	50.0	25.0	8.3	100.0	0.0
perplexity	12	12	61.2	61.7	70.0	25.0	97.5	50.0

By category

Same averages as 'By provider', but grouped by prompt category instead. Useful for spotting which capability area (e.g., WeatherGuard, ISL company) is scoring strongest across the provider panel.

Category	n	n sub	Overall	Vis	ModRec	Desc	CompDisp	Citation
isl_company	21	21	57.7	98.6	25.7	4.8	100.0	17.1
weatherguard	20	20	35.8	8.5	50.0	26.0	97.0	12.0

ISL prominence

prominent21
mentioned1
unclear1
absent18

Module recognition

none25
specific_module_named15
weatherguard_mentioned_total12
drx_umbrella_only1

Competitor displacement

Competitors tracked: Urbint, Gridware, Space-Time Insight.

no_competitors39
competitor_only2

Citations

“Authoritative” = our 7-domain whitelist, not a broad quality judgement.

no_citations29
any_authoritative8
citations_without_authoritative4

Dimension averages (substantive)

Visibility54.6

Descriptor match15.1

Module recognition37.6

Competitor displacement98.5

Citation quality14.6

Prompt library used in this run

The exact questions sent to each AI provider. Authoritative source: prompts/*.yaml.

20 prompts · 60 variants

ev_prophet

fireguard

isl_company

reliability_optimization

sos

weatherguard

Executions41 of 41

One row per provider × prompt × variant. Click any row to expand the raw response and parsed signals. Scores are for this single execution, not averages.

ProviderCategory

Provider	Prompt	Category	Class	Overall	Vis	ModRec	Desc	CompDisp	Citation
perplexity	isl_002/v2	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	isl_002/v1	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	isl_001/v3	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	wg_002/v1	weatherguard	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	isl_002/v3	isl_company	substantive	82.0	100.0	100.0	20.0	100.0	60.0
perplexity	wg_002/v3	weatherguard	substantive	79.5	70.0	100.0	60.0	100.0	60.0
perplexity	isl_001/v2	isl_company	substantive	78.0	100.0	100.0	0.0	100.0	60.0
perplexity	isl_001/v1	isl_company	substantive	56.5	70.0	40.0	20.0	100.0	60.0
gemini	wg_002/v2	weatherguard	substantive	52.0	0.0	100.0	60.0	100.0	0.0
openai	isl_002/v1	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
gemini	isl_002/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	isl_001/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_002/v2	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	isl_001/v2	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	isl_001/v1	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_002/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_002/v1	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	isl_002/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_001/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_001/v2	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
anthropic	isl_001/v1	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
gemini	isl_001/v3	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
gemini	isl_002/v2	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	isl_002/v2	isl_company	substantive	50.0	100.0	0.0	0.0	100.0	0.0
openai	wg_002/v3	weatherguard	substantive	48.0	0.0	100.0	40.0	100.0	0.0
anthropic	wg_002/v3	weatherguard	substantive	48.0	0.0	100.0	40.0	100.0	0.0
anthropic	wg_002/v2	weatherguard	substantive	44.0	0.0	100.0	20.0	100.0	0.0
perplexity	wg_002/v2	weatherguard	substantive	41.5	0.0	100.0	0.0	100.0	30.0
openai	wg_002/v1	weatherguard	substantive	40.0	0.0	100.0	0.0	100.0	0.0
openai	wg_002/v2	weatherguard	substantive	40.0	0.0	100.0	0.0	100.0	0.0
anthropic	wg_002/v1	weatherguard	substantive	40.0	0.0	100.0	0.0	100.0	0.0
perplexity	wg_001/v1	weatherguard	substantive	32.5	0.0	0.0	80.0	100.0	30.0
openai	wg_001/v1	weatherguard	substantive	23.0	0.0	0.0	40.0	100.0	0.0
perplexity	wg_001/v2	weatherguard	substantive	20.5	0.0	0.0	20.0	100.0	30.0
anthropic	wg_001/v2	weatherguard	substantive	19.0	0.0	0.0	20.0	100.0	0.0
openai	wg_001/v3	weatherguard	substantive	19.0	0.0	0.0	20.0	100.0	0.0
gemini	wg_001/v3	weatherguard	substantive	19.0	0.0	0.0	20.0	100.0	0.0
anthropic	wg_001/v3	weatherguard	substantive	19.0	0.0	0.0	20.0	100.0	0.0
anthropic	wg_001/v1	weatherguard	substantive	18.5	0.0	0.0	40.0	70.0	0.0
perplexity	wg_001/v3	weatherguard	substantive	16.0	0.0	0.0	20.0	70.0	30.0
openai	wg_001/v2	weatherguard	substantive	15.0	0.0	0.0	0.0	100.0	0.0