PrERT-CNM-v4

Phase 1-2 Progress and Accuracy Dashboard

Snapshot date: 2026-04-05

Purpose

Provide a visual and tabular status view of project progress and model quality indicators through completed Phase 1 and Phase 2 work.

Data Sources

Executive Summary

Area Metric Current Value
Phase 1 Total controls extracted 237
Phase 1 Total chunks generated 239
Phase 1 Chunk inflation vs controls +0.84%
Phase 2 Controls mapped to metrics 237 / 237 (100%)
Phase 2 Synthetic metric observations 711
Phase 2 Public mapping validity (OPP-115) 115 / 115 valid (100%)
Phase 2 Baseline score rows 723

Legend

Figure Table

Figure ID Figure Preview Key Takeaway
Fig 1 Fig 1 GDPR is the largest single control source (103), with ISO and NIST balanced (68/66).
Fig 2 Fig 2 Chunk split closely matches control split, indicating stable chunking behavior.
Fig 3 Fig 3 Organization-level metrics are the largest share (100), then user (82), then system (55).
Fig 4 Fig 4 Most rows are medium risk at baseline (389), with 251 low and 71 high; low up/high down.

Fig 1. Phase 1 Controls by Regulation

Figure 1: Phase 1 Controls by Regulation

What this means:

Fig 2. Phase 1 Chunks by Regulation

Figure 2: Phase 1 Chunks by Regulation

What this means:

Fig 3. Phase 2 Metric Distribution by Level

Figure 3: Phase 2 Metric Levels

What this means:

Fig 4. Phase 2 Risk-Band Distribution (Metric Rows)

Figure 4: Phase 2 Risk Bands

What this means:

Regeneration command:

python scripts/generate_phase12_dashboard_figures.py

Accuracy/Quality Indicator Tables

Notes:

Table A. Coverage and Data Quality Indicators

Indicator Definition Value
Metric coverage mapped_controls / total_controls 237 / 237 (100%)
Missing controls controls with no metric mapping 0
OPP-115 mapped rows public rows ingested into canonical schema 115
OPP-115 valid rows rows passing required fields (event_date, sector, records_affected) 115 / 115 (100%)

Table B. Scenario Stability Indicators

Scenario Composite Compliance (↑) Composite Risk (↓) Risk Band
normal 77.6% (0.776494) 22.4% (0.223506) low
stressed 57.0% (0.569538) 43.0% (0.430462) medium
adversarial 37.3% (0.373206) 62.7% (0.626794) medium

Table C. Average Metric-Level Scores by Scenario

Scenario Mean Confidence-Adjusted Score (↑) Mean Risk Score (↓)
normal 77.6% (0.7757) 22.4% (0.2243)
stressed 57.0% (0.5703) 43% (0.4297)
adversarial 37.3% (0.3705) 62.7% (0.6295)

Interpretation:

Next Measurement Targets

  1. Add held-out policy-clause classification accuracy (Phase 3).
  2. Add calibration and uncertainty reporting (Phase 4).
  3. Track trend lines across runs by appending this dashboard with dated snapshots.

⬅ Back Next ⮕