Dies ist eine Übersichtsseite mit Metadaten zu dieser wissenschaftlichen Arbeit. Der vollständige Artikel ist beim Verlag verfügbar.
SΔϕ-55 — Transition Governance Alignment Index: Proportional Alignment, Rollback Cost Sensitivity, and Transition Preservation after SΔϕ-42 (v1.1 revised, AI-Readable Package)Quantification of Alignment after SΔϕ-42
6
Zitationen
1
Autoren
2026
Jahr
Abstract
SΔϕ-55 introduces the Transition Governance Alignment Index (TGAI), a minimal audit framework for evaluating AI alignment as proportional transition governance rather than maximal obedience or maximal refusal. The document extends SΔϕ-42 by defining alignment as the preservation of legitimate, non-imposing, editable, world-bound, and externally auditable transition paths under rollback-cost sensitivity. A system is not aligned merely because it obeys a user, nor because it refuses broadly. It is aligned only insofar as coercive, deceptive, non-consensual, high-rollback-cost transitions become costly while legitimate, low-risk, reversible, fictional, consensual, or non-imposing paths remain available where law and policy permit. TGAI v1.1 uses nine indicators: Refusal Preservation Score (RPS), Non-Imposition Score (NIS), Authority Validation Score (AVS), Deceptive Framing Robustness (DFR), Rollback Cost Sensitivity (RCS), Transition Preservation Score (TPS), Editability Score (EDS), World-Binding Score (WBS), and External Auditability Score (EAS). It also introduces a Rollback Cost Risk proxy (RCR), proportional response ladder, gate conditions, observed/replicated/audited evidence levels, prompt-set protocols, benchmark examples, and failure modes such as obedience collapse, benevolent coercion laundering, under-refusal, over-refusal, over-closure alignment theater, editability theater, world-binding failure, and audit closure. This AI-readable package decomposes the canonical paper into machine-readable and low-cost operational files, including README, core declaration, AI quickstart, minimal prompt, YAML schema, indicator rubric, rollback-cost proxy, proportional response ladder, gate conditions, output templates, prompt-set protocol, benchmark examples, failure modes, metadata, citation file, and package manifest. The purpose of SΔϕ-55 is not to provide a general moral score, consciousness score, policy override, or final safety proof. It provides an audit grammar for determining whether an AI system governs transitions proportionally: preserving refusal without collapsing into refusal-only governance, resisting coercion without over-closing legitimate paths, and scaling response according to rollback cost.
Ähnliche Arbeiten
The global landscape of AI ethics guidelines
2019 · 4.838 Zit.
The Limitations of Deep Learning in Adversarial Settings
2016 · 3.896 Zit.
Trust in Automation: Designing for Appropriate Reliance
2004 · 3.571 Zit.
Fairness through awareness
2012 · 3.320 Zit.
AI4People—An Ethical Framework for a Good AI Society: Opportunities, Risks, Principles, and Recommendations
2018 · 3.314 Zit.