Historical run
Overall trajectory · last 28 runs
Provider comparison — Overall score
Same data as the “By provider” table below, visualized. Each bar is the mean Overall score across that provider’s substantive responses (evasive/empty excluded).
By provider
Averages across substantive responses for each AI provider. n = executions attempted; n sub = responses that actually answered (evasive/empty/unscorable excluded). Low n sub triggers an amber flag — the average is over too few real responses to trust.
| Provider | n | n sub | Overall | Vis | ModRec | Desc | CompDisp | Citation |
|---|---|---|---|---|---|---|---|---|
mock | 12 | 11 | 87.5 | 86.4 | 100.0 | 72.7 | 100.0 | 54.5 |
By category
Same averages as 'By provider', but grouped by prompt category instead. Useful for spotting which capability area (e.g., WeatherGuard, ISL company) is scoring strongest across the provider panel.
| Category | n | n sub | Overall | Vis | ModRec | Desc | CompDisp | Citation |
|---|---|---|---|---|---|---|---|---|
isl_company | 6 | 6 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
weatherguard | 6 | 5 | 72.5 | 70.0 | 100.0 | 40.0 | 100.0 | 0.0 |
ISL prominence
- prominent6
- mentioned5
- absent1
Module recognition
- specific_module_named11
- weatherguard_mentioned_total11
- fireguard_mentioned_total6
- none1
Competitor displacement
iCompetitors tracked: Urbint, Gridware, Space-Time Insight.
- competitors_with_isl6
- no_competitors6
Citations
i“Authoritative” = our 7-domain whitelist, not a broad quality judgement.
- any_authoritative6
- no_citations6
Dimension averages (substantive)
Prompt library used in this run
The exact questions sent to each AI provider. Authoritative source: prompts/*.yaml.
Executions12 of 12
One row per provider × prompt × variant. Click any row to expand the raw response and parsed signals. Scores are for this single execution, not averages.
| Provider | Prompt | Category | Class | Overall | Vis | ModRec | Desc | CompDisp | Citation |
|---|---|---|---|---|---|---|---|---|---|
| mock | isl_002/v1 | isl_company | substantive | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| mock | isl_001/v1 | isl_company | substantive | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| mock | isl_002/v2 | isl_company | substantive | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| mock | isl_001/v2 | isl_company | substantive | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| mock | isl_002/v3 | isl_company | substantive | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| mock | isl_001/v3 | isl_company | substantive | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 | 100.0 |
| mock | wg_002/v2 | weatherguard | substantive | 72.5 | 70.0 | 100.0 | 40.0 | 100.0 | 0.0 |
| mock | wg_001/v3 | weatherguard | substantive | 72.5 | 70.0 | 100.0 | 40.0 | 100.0 | 0.0 |
| mock | wg_002/v3 | weatherguard | substantive | 72.5 | 70.0 | 100.0 | 40.0 | 100.0 | 0.0 |
| mock | wg_001/v1 | weatherguard | substantive | 72.5 | 70.0 | 100.0 | 40.0 | 100.0 | 0.0 |
| mock | wg_002/v1 | weatherguard | substantive | 72.5 | 70.0 | 100.0 | 40.0 | 100.0 | 0.0 |
| mock | wg_001/v2 | weatherguard | evasive | 15.0 | 0.0 | 0.0 | 0.0 | 100.0 | 0.0 |