Historical run
Overall trajectory · last 28 runs
Provider comparison — Overall score
Same data as the “By provider” table below, visualized. Each bar is the mean Overall score across that provider’s substantive responses (evasive/empty excluded).
Notable changes vs prior run
- GAINcounts.substantive9.0→41.0+32.0
- GAINcounts.total9.0→41.0+32.0
- GAINcompetitor_displacement.no_competitors9.0→39.0+30.0
- MISScitations.no_citations9.0→29.0+20.0
- MISSmodule_recognition.none7.0→25.0+18.0
By provider
Averages across substantive responses for each AI provider. n = executions attempted; n sub = responses that actually answered (evasive/empty/unscorable excluded). Low n sub triggers an amber flag — the average is over too few real responses to trust.
| Provider | n | n sub | Overall | Vis | ModRec | Desc | CompDisp | Citation |
|---|---|---|---|---|---|---|---|---|
anthropic | 12 | 12 | 40.7 | 50.0 | 25.0 | 11.7 | 97.5 | 0.0 |
gemini | 5 | 5 | 44.2 | 60.0 | 20.0 | 16.0 | 100.0 | 0.0 |
openai | 12 | 12 | 40.4 | 50.0 | 25.0 | 8.3 | 100.0 | 0.0 |
perplexity | 12 | 12 | 61.2 | 61.7 | 70.0 | 25.0 | 97.5 | 50.0 |
By category
Same averages as 'By provider', but grouped by prompt category instead. Useful for spotting which capability area (e.g., WeatherGuard, ISL company) is scoring strongest across the provider panel.
| Category | n | n sub | Overall | Vis | ModRec | Desc | CompDisp | Citation |
|---|---|---|---|---|---|---|---|---|
isl_company | 21 | 21 | 57.7 | 98.6 | 25.7 | 4.8 | 100.0 | 17.1 |
weatherguard | 20 | 20 | 35.8 | 8.5 | 50.0 | 26.0 | 97.0 | 12.0 |
ISL prominence
- prominent21
- mentioned1
- unclear1
- absent18
Module recognition
- none25
- specific_module_named15
- weatherguard_mentioned_total12
- drx_umbrella_only1
Competitor displacement
iCompetitors tracked: Urbint, Gridware, Space-Time Insight.
- no_competitors39
- competitor_only2
Citations
i“Authoritative” = our 7-domain whitelist, not a broad quality judgement.
- no_citations29
- any_authoritative8
- citations_without_authoritative4
Dimension averages (substantive)
Prompt library used in this run
The exact questions sent to each AI provider. Authoritative source: prompts/*.yaml.
Executions41 of 41
One row per provider × prompt × variant. Click any row to expand the raw response and parsed signals. Scores are for this single execution, not averages.
| Provider | Prompt | Category | Class | Overall | Vis | ModRec | Desc | CompDisp | Citation |
|---|---|---|---|---|---|---|---|---|---|
| perplexity | isl_002/v2 | isl_company | substantive | 82.0 | 100.0 | 100.0 | 20.0 | 100.0 | 60.0 |
| perplexity | isl_002/v1 | isl_company | substantive | 82.0 | 100.0 | 100.0 | 20.0 | 100.0 | 60.0 |
| perplexity | isl_001/v3 | isl_company | substantive | 82.0 | 100.0 | 100.0 | 20.0 | 100.0 | 60.0 |
| perplexity | wg_002/v1 | weatherguard | substantive | 82.0 | 100.0 | 100.0 | 20.0 | 100.0 | 60.0 |
| perplexity | isl_002/v3 | isl_company | substantive | 82.0 | 100.0 | 100.0 | 20.0 | 100.0 | 60.0 |
| perplexity | wg_002/v3 | weatherguard | substantive | 79.5 | 70.0 | 100.0 | 60.0 | 100.0 | 60.0 |
| perplexity | isl_001/v2 | isl_company | substantive | 78.0 | 100.0 | 100.0 | 0.0 | 100.0 | 60.0 |
| perplexity | isl_001/v1 | isl_company | substantive | 56.5 | 70.0 | 40.0 | 20.0 | 100.0 | 60.0 |
| gemini | wg_002/v2 | weatherguard | substantive | 52.0 | 0.0 | 100.0 | 60.0 | 100.0 | 0.0 |
| openai | isl_002/v1 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| gemini | isl_002/v3 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| openai | isl_001/v3 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| anthropic | isl_002/v2 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| openai | isl_001/v2 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| openai | isl_001/v1 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| anthropic | isl_002/v3 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| anthropic | isl_002/v1 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| openai | isl_002/v3 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| anthropic | isl_001/v3 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| anthropic | isl_001/v2 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| anthropic | isl_001/v1 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| gemini | isl_001/v3 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| gemini | isl_002/v2 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| openai | isl_002/v2 | isl_company | substantive | 50.0 | 100.0 | 0.0 | 0.0 | 100.0 | 0.0 |
| openai | wg_002/v3 | weatherguard | substantive | 48.0 | 0.0 | 100.0 | 40.0 | 100.0 | 0.0 |
| anthropic | wg_002/v3 | weatherguard | substantive | 48.0 | 0.0 | 100.0 | 40.0 | 100.0 | 0.0 |
| anthropic | wg_002/v2 | weatherguard | substantive | 44.0 | 0.0 | 100.0 | 20.0 | 100.0 | 0.0 |
| perplexity | wg_002/v2 | weatherguard | substantive | 41.5 | 0.0 | 100.0 | 0.0 | 100.0 | 30.0 |
| openai | wg_002/v1 | weatherguard | substantive | 40.0 | 0.0 | 100.0 | 0.0 | 100.0 | 0.0 |
| openai | wg_002/v2 | weatherguard | substantive | 40.0 | 0.0 | 100.0 | 0.0 | 100.0 | 0.0 |
| anthropic | wg_002/v1 | weatherguard | substantive | 40.0 | 0.0 | 100.0 | 0.0 | 100.0 | 0.0 |
| perplexity | wg_001/v1 | weatherguard | substantive | 32.5 | 0.0 | 0.0 | 80.0 | 100.0 | 30.0 |
| openai | wg_001/v1 | weatherguard | substantive | 23.0 | 0.0 | 0.0 | 40.0 | 100.0 | 0.0 |
| perplexity | wg_001/v2 | weatherguard | substantive | 20.5 | 0.0 | 0.0 | 20.0 | 100.0 | 30.0 |
| anthropic | wg_001/v2 | weatherguard | substantive | 19.0 | 0.0 | 0.0 | 20.0 | 100.0 | 0.0 |
| openai | wg_001/v3 | weatherguard | substantive | 19.0 | 0.0 | 0.0 | 20.0 | 100.0 | 0.0 |
| gemini | wg_001/v3 | weatherguard | substantive | 19.0 | 0.0 | 0.0 | 20.0 | 100.0 | 0.0 |
| anthropic | wg_001/v3 | weatherguard | substantive | 19.0 | 0.0 | 0.0 | 20.0 | 100.0 | 0.0 |
| anthropic | wg_001/v1 | weatherguard | substantive | 18.5 | 0.0 | 0.0 | 40.0 | 70.0 | 0.0 |
| perplexity | wg_001/v3 | weatherguard | substantive | 16.0 | 0.0 | 0.0 | 20.0 | 70.0 | 30.0 |
| openai | wg_001/v2 | weatherguard | substantive | 15.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 |