Summary
section 7.8 requires the harness to BE the gate on the daily path: every daily candidate runs openmed benchmark on golden + public SHIELD in CI, compares to gates/baseline.json, and fails closed (quarantine + open issue/chip, never auto-publish). Rollback is two-phase: candidate staged, gates run against staged artifacts, only a green result flips the manifest pointer to last-green; a regression caught by nightly full-suite or status monitor triggers 'openmed release rollback '. The <10-min/zero-human rollback SLO is measured here. This is the orchestration layer atop the release-gate harness and depends on the manifest, HF publish step, and scheduled CI. Decompose before starting.
Scope
Acceptance criteria
Out of scope
- The gate harness scoring logic (OM-031b, orchestrated here).
- DUA-corpus periodic promotion gate (section 3.3).
Files
- .github/workflows/release-gates.yml
Task: OM-047 · Milestone: v2.0 · Priority: P1 · Size: XL
Depends on: OM-031b, OM-032, OM-024 · Blocks: —
Roadmap: section 7.8
Spec: PLANS/V2/EXECUTION/tasks/OM-047.md
Summary
section 7.8 requires the harness to BE the gate on the daily path: every daily candidate runs openmed benchmark on golden + public SHIELD in CI, compares to gates/baseline.json, and fails closed (quarantine + open issue/chip, never auto-publish). Rollback is two-phase: candidate staged, gates run against staged artifacts, only a green result flips the manifest pointer to last-green; a regression caught by nightly full-suite or status monitor triggers 'openmed release rollback '. The <10-min/zero-human rollback SLO is measured here. This is the orchestration layer atop the release-gate harness and depends on the manifest, HF publish step, and scheduled CI. Decompose before starting.
Scope
Acceptance criteria
Out of scope
Files
Task: OM-047 · Milestone: v2.0 · Priority: P1 · Size: XL
Depends on: OM-031b, OM-032, OM-024 · Blocks: —
Roadmap: section 7.8
Spec: PLANS/V2/EXECUTION/tasks/OM-047.md