PrERT-CNM-v4

Phase 2 Implementation Runbook

This runbook executes Phase 2 in an isolated output folder so Phase 1 code and artifacts remain unchanged.

Scope

Inputs

Required:

Optional:

Using OPP-115

Commands

Create processed OPP-115 flat files (default: consolidation threshold 0.75):

PYTHONPATH=src python scripts/process_opp115_for_phase2.py

Create processed OPP-115 files from the raw annotation set instead of consolidated annotations:

PYTHONPATH=src python scripts/process_opp115_for_phase2.py \
  --input-set annotations

Equivalent package command:

prert-opp115

Run Phase 2 using OPP-115 as a reference corpus only (no --public-input):

PYTHONPATH=src python scripts/run_phase2_metrics.py

Run Phase 2 with a processed OPP-115 CSV export:

PYTHONPATH=src python scripts/run_phase2_metrics.py \
  --public-input data/processed/opp115_public_mapping.csv

Run Phase 2 with a processed OPP-115 JSONL export:

PYTHONPATH=src python scripts/run_phase2_metrics.py \
  --public-input data/processed/opp115_public_mapping.jsonl

Run Phase 2 with processed OPP-115 input and a custom output folder:

PYTHONPATH=src python scripts/run_phase2_metrics.py \
  --public-input data/processed/opp115_public_mapping.csv \
  --output-dir artifacts/phase-2

Outputs

Written to artifacts/phase-2/:

Quality Checks

Notes


⬅ Back Next ⮕