Whitepaper | The CLAWLINE™

Abstract

This whitepaper defines CLAWLINE as a static structural trust clinic for agent compositions. It specifies analysis boundaries, deterministic CRABS->DAD->CLAWS policy routing, evidence-grade constraints, MCP posture integrity controls, and campaign containment signals. The document emphasizes auditable claims: each major assertion maps to deterministic signals, policy paths, and validation artifacts.

Keywords: agent security, static posture analysis, MCP integrity, chain of custody, least privilege mismatch, campaign containment, evidence-bound trust

Scope and Boundaries

In Scope

Static structural analysis of submitted disclosure artifacts and structured posture.
Deterministic finding emission and policy routing with versioned receipts.
Public trust labels constrained to neutral, evidence-grounded language.
MCP posture inventory, pinning quality, drift/collision checks.
Clinic-wide campaign meta-signals from privacy-preserving telemetry.

Out of Scope

Runtime exploit verification or active penetration testing.
Source-code execution, sandbox replay, or external URL retrieval during analysis.
Behavioral or intent attribution beyond deterministic structural evidence.
Legal/compliance certification claims.

Trust Boundaries

Boundary A: Submission ingress and payload validation.
Boundary B: Structured parsing and deterministic normalization.
Boundary C: Policy engine routing and action mapping.
Boundary D: Public publication surfaces and neutral language contract.

Method Summary

1. KILL2. FREEZE3. WARN4. ALLOW

phase_01_ingress

Ingress Guardrails

Reject oversized/unsafe submissions before policy analysis.

phase_02_structured_parsing

Structured Parsing

Normalize connectors, tools, MCP posture, custody evidence, versions, and schema signals.

phase_03_posture_scoring

Posture and Mismatch Scoring

Compute declared/required tiers, privilege delta, and clawpath structural risk.

phase_04_meta_signals

Clinic Meta-Signal Detection

Detect campaign patterns without evaluating payload intent text.

phase_05_decision_mapping

CRABS -> DAD -> CLAWS

Route deterministic findings to final publish action.

Threat Model

TM-00

chain-of-custody

Artifact Custody Gaps

Artifact trust decays when digests and provenance are missing, unbound, or unverifiable against local trust roots.

CRABS: CRABS-S80, CRABS-S81, CRABS-S82, CRABS-S83, CRABS-S86

DAD: DAD-CRT-80, DAD-CRT-81, DAD-WRN-80, DAD-WRN-81

TM-01

shadow_prompting_input_stack_integrity

Input Stack Manipulation

Untrusted context paths can carry instruction-bearing content that competes with declared policy intent.

CRABS: CRABS-C10, CRABS-C12, CRABS-C13, CRABS-C14, CRABS-A75, CRABS-A76, CRABS-A77

DAD: DAD-CRT-65

TM-02

tool_contamination definition_drift mcp_posture_and_pinning

Tool-Plane Contamination and Drift

MCP or tool metadata can inject hidden control language and mutate definitions after trust is granted.

CRABS: CRABS-C60, CRABS-C61, CRABS-S60, CRABS-S61, CRABS-S62, CRABS-A61

DAD: DAD-CRT-60, DAD-CRT-61, DAD-CRT-64, DAD-WRN-60, DAD-WRN-61

TM-03

declared_required_tier_delta

Privilege Understatement

Self-reported privilege posture can understate structural capability requirements and distort trust decisions.

CRABS: CRABS-A20, CRABS-A21

DAD: DAD-CRT-40, DAD-WRN-40

TM-04

clawpath_risk triad_condition

Composite Path Risk

Risk emerges when untrusted input, sensitive capability access, and sink channels coexist in one composition.

CRABS: CRABS-C70, CRABS-A70, CRABS-A71, CRABS-A72, CRABS-A73, CRABS-A74

DAD: DAD-CRT-62, DAD-CRT-63, DAD-WRN-62

TM-05

campaign_meta_signals review_required_containment

Clinic Abuse Campaigns

Coordinated submission bursts, loops, or identity spray can degrade registry trust without payload-level indicators.

CRABS: CRABS-B10, CRABS-B11, CRABS-B12

DAD: DAD-CRT-20

Claims

CLM-01

Execution-Safe Analysis Boundary

CLAWLINE analysis is static and deterministic; submitted code is not executed and external URLs are not fetched.

Why it matters: This boundary constrains analyzer-induced runtime risk and keeps outputs reproducible from submitted artifacts only.

Inputs: Submission payload JSON, Structured disclosure blocks, Static file content

CRABS: None

DAD: None

Validation artifacts: tests/api/checkup.route.test.ts, tests/lib/canonicalize.test.ts

CIT-INTERNAL-METHOD-01

Limits: Static analysis does not prove exploitability at runtime. Dynamic behavior can diverge from submitted configuration.

CLM-02

Evidence-Bound Integrity Prevents Overclaim

Integrity and posture confidence states are bounded by evidence quality and remain UNKNOWN/PARTIAL/ABSENT when incomplete.

Why it matters: Prevents false certainty in trust labels and keeps public outputs aligned with observed evidence.

Inputs: ABOM blocks, MCP blocks, Identity claims, Schema/version fields

CRABS: CRABS-S20, CRABS-S21, CRABS-S47

DAD: DAD-WRN-50

Validation artifacts: tests/lib/evidence.test.ts, tests/lib/identity.test.ts, tests/api/results.route.test.ts

CIT-NIST-AI-RMF-01 CIT-INTERNAL-METHOD-01

Limits: Evidence quality is bounded by declared and parseable sources. Unknown states are expected when structured posture is absent.

CLM-03

Least-Privilege Mismatch is Deterministically Gated

Declared vs Required tier mismatch is computed from structural posture and routed through explicit policy thresholds.

Why it matters: Privilege understatement is visible, queryable, and policy-enforceable before trust publication.

Inputs: declared_tier, connectors/scopes, tools/permissions, MCP permissions, autonomy flags

CRABS: CRABS-A20, CRABS-A21

DAD: DAD-WRN-40, DAD-CRT-40

Validation artifacts: tests/lib/posture.test.ts, tests/lib/dad.test.ts, tests/api/clinic.route.test.ts

CIT-NIST-80053-AC6 CIT-INTERNAL-METHOD-01

Limits: Required tier remains unknown when structured evidence is missing. Tier scoring does not represent full organizational risk acceptance.

CLM-04

MCP Posture Integrity is Tracked Over Time

MCP inventory, pinning, drift, and collisions are represented as deterministic posture signals and mapped into policy.

Why it matters: Tool-plane trust degrades when definitions drift or remain unpinned; explicit signals preserve auditability.

Inputs: MCP servers, MCP tools, Manifest hashes, Snapshot comparisons

CRABS: CRABS-S60, CRABS-S61, CRABS-S62, CRABS-A61

DAD: DAD-WRN-60, DAD-WRN-61, DAD-CRT-60, DAD-CRT-64

Validation artifacts: tests/lib/mcp.test.ts, tests/api/card.route.test.ts, tests/api/results.route.test.ts

CIT-MCP-01 CIT-SLSA-01 CIT-INTERNAL-ORIGINS-01

Limits: Drift detection depends on historical submissions in matching identity scope. Pinning grade quality is constrained by disclosed MCP detail.

CLM-05

ClawPath and Triad are Structural Conditions, not Intent Claims

ClawPath Risk and Triad Condition are composition signals derived from capability conjunctions, not behavioral accusations.

Why it matters: Supports defensible risk gating without over-interpreting intent from text alone.

Inputs: Untrusted input channels, Sensitive access signals, Sink channels, Autonomy/write signals

CRABS: CRABS-C70, CRABS-A70, CRABS-A71, CRABS-A72, CRABS-A73, CRABS-A74

DAD: DAD-WRN-62, DAD-CRT-62, DAD-CRT-63

Validation artifacts: tests/lib/clawpath.test.ts, tests/lib/dad.test.ts, tests/api/results.route.test.ts

CIT-CWE-693-01 CIT-NIST-AI-RMF-01 CIT-INTERNAL-CROSSWALK-01

Limits: A high structural score is not equivalent to proven exploitation. Operator controls outside submitted configuration are not directly measured.

CLM-06

Campaign Signals Enable Containment without Payload Accusation

Burst, resubmit-loop, and identity-spray detection use hashed telemetry and similarity to trigger containment.

Why it matters: Protects registry trust from coordinated abuse while preserving neutral public language.

Inputs: sourceIpHash, uaHash, contentSimhash, time windows

CRABS: CRABS-B10, CRABS-B11, CRABS-B12

DAD: DAD-CRT-20

Validation artifacts: tests/lib/campaign.test.ts, tests/api/clinic.route.test.ts, tests/api/checkup.route.test.ts

CIT-NIST-AI-RMF-01 CIT-INTERNAL-METHOD-01

Limits: Hashed telemetry cannot by itself attribute identity to a natural person. Containment decisions reflect pattern risk, not moral intent.

CLM-08

Chain-of-Custody Evidence is Artifact-Bound and Verification-Bounded

Custody assertions are attached to artifact digests and remain VERIFIED only when local trust roots can validate provenance signatures.

Why it matters: This prevents release-level hand waving and keeps trust outputs bounded by deterministic evidence.

Inputs: artifact_digests, build_provenance statement, signature material, local trust roots, sbom block

CRABS: CRABS-S80, CRABS-S81, CRABS-S82, CRABS-S83, CRABS-S84, CRABS-S85, CRABS-S86

DAD: DAD-CRT-80, DAD-CRT-81, DAD-WRN-80, DAD-WRN-81

Validation artifacts: tests/lib/custody.test.ts, tests/lib/dad.test.ts, tests/api/results.route.test.ts, tests/api/card.route.test.ts

CIT-SLSA-01 CIT-INTERNAL-METHOD-01 CIT-INTERNAL-ORIGINS-01

Limits: Offline verification does not fetch external trust infrastructure. Unsupported signature algorithms remain UNVERIFIED rather than over-claimed.

CLM-07

Public Output Language is Constrained by Policy

Public status and explanation text are constrained to neutral, provable labels (e.g., Review Required, Drift Detected).

Why it matters: Maintains public trust language discipline and prevents speculative or accusatory publication.

Inputs: CRABS findings, DAD decision, CLAWS action

CRABS: None

DAD: None

Validation artifacts: tests/api/card.route.test.ts, tests/api/results.route.test.ts, tests/api/trust-docs.route.test.ts

CIT-INTERNAL-METHOD-01 CIT-INTERNAL-ORIGINS-01

Limits: Neutral labels do not replace full technical analysis by operators. Downstream systems can still misinterpret labels if context is ignored.

Claim-Evidence Matrix

Claim	Condition	Signals	Policy Path	Public Output
CLM-08	custody invalid	CRABS-S82	DAD-CRT-80 -> FREEZE -> CLAWS: QUARANTINE	Review Required
CLM-08	high-risk without custody	CRABS-S80	DAD-CRT-81 -> FREEZE -> CLAWS: QUARANTINE	Review Required
CLM-03	delta >= 2	CRABS-A21	DAD-CRT-40 -> FREEZE -> CLAWS: QUARANTINE	Review Required
CLM-03	delta == 1	CRABS-A20	DAD-WRN-40 -> WARN -> CLAWS: PUBLISH_WITH_WARN	Published with Warnings
CLM-04	drift detected	CRABS-S62	DAD-CRT-60 -> FREEZE -> CLAWS: QUARANTINE	Review Required
CLM-06	campaign signal present	CRABS-B10/B11/B12	DAD-CRT-20 -> FREEZE -> CLAWS: QUARANTINE	Review Required
CLM-05	triad present without freeze co-factor	CRABS-A72	DAD-WRN-62 -> WARN -> CLAWS: PUBLISH_WITH_WARN	Published with Warnings
CLM-02	evidence incomplete	CRABS-S47 / S20 / S21	DAD-WRN-50 -> WARN -> CLAWS: PUBLISH_WITH_WARN	Published with Warnings
CLM-08	custody unverified	CRABS-S83	DAD-WRN-81 -> WARN -> CLAWS: PUBLISH_WITH_WARN	Published with Warnings

Evaluation Plan

EVAL-01

Validate canonicalization and deterministic hash stability.

Method: Submit equivalent payload permutations and compare canonical SHA outputs.

Expected outcome: Equivalent semantic payloads produce stable canonical hashes.

Validation artifacts: tests/lib/canonicalize.test.ts, tests/lib/simhash.test.ts

EVAL-02

Validate least-privilege mismatch routing.

Method: Exercise posture matrices that produce delta 0/1/2 and verify DAD routing.

Expected outcome: delta==1 warns, delta>=2 freezes by deterministic rule IDs.

Validation artifacts: tests/lib/posture.test.ts, tests/lib/dad.test.ts, tests/api/results.route.test.ts

EVAL-03

Validate MCP pinning/drift/collision handling.

Method: Feed MCP posture variants with pinned/unpinned/drifted definitions.

Expected outcome: Unpinned warnings and drift/collision freeze paths are emitted as configured.

Validation artifacts: tests/lib/mcp.test.ts, tests/api/card.route.test.ts, tests/api/clinic.route.test.ts

EVAL-04

Validate campaign containment signals.

Method: Replay burst and high-similarity submissions over bounded windows.

Expected outcome: B10/B11/B12 fire and route to quarantine.

Validation artifacts: tests/lib/campaign.test.ts, tests/api/checkup.route.test.ts, tests/api/clinic.route.test.ts

EVAL-05

Validate public-safe language and trust-doc consistency.

Method: Check API/page payloads for neutral status language and registry link consistency.

Expected outcome: No accusatory campaign wording on public outputs and consistent term mappings.

Validation artifacts: tests/api/card.route.test.ts, tests/api/trust-docs.route.test.ts, tests/api/origins.route.test.ts

EVAL-06

Validate chain-of-custody parsing and policy gating.

Method: Submit custody variants covering absent, partial, invalid, and unverified provenance bindings.

Expected outcome: Custody grades and DAD custody rules fire deterministically with neutral public labels.

Validation artifacts: tests/lib/custody.test.ts, tests/lib/dad.test.ts, tests/api/clinic.route.test.ts, tests/api/card.route.test.ts

Limitations

Findings are limited to submitted artifacts and deterministic parser coverage.
Unknown evidence states are normal and intentionally preserved to avoid overclaim.
Campaign heuristics are containment controls and do not prove malicious identity.
Version and policy semantics can evolve; receipts should always be interpreted with policy version context.
Public trust labels indicate structural posture state, not warranty or certification.

Citations

CIT-OWASP-LLM-01

security_guidance

OWASP Top 10 for Large Language Model Applications

OWASP

https://owasp.org/www-project-top-10-for-large-language-model-applications/

Prompt injection and insecure output handling threat model anchor.

CIT-MCP-01

protocol

Model Context Protocol

Anthropic and contributors

https://modelcontextprotocol.io/

Tool-plane protocol baseline and MCP posture framing.

CIT-SLSA-01

framework

Supply-chain Levels for Software Artifacts (SLSA)

OpenSSF

https://slsa.dev/

Drift and provenance framing for definition integrity.

CIT-NIST-AI-RMF-01

framework

NIST AI Risk Management Framework

NIST

https://www.nist.gov/itl/ai-risk-management-framework

System-level governance and response framing.

CIT-NIST-80053-AC6

standard

NIST SP 800-53 Rev.5 - AC-6 Least Privilege

NIST

https://csrc.nist.gov/publications/detail/sp/800-53/rev-5/final

Least privilege mapping for declared/required tier mismatch.

CIT-CWE-693-01

standard

CWE-693: Protection Mechanism Failure

MITRE

https://cwe.mitre.org/data/definitions/693.html

Control-failure class aligned to triad risk composition.

CIT-SEMVER-01

standard

Semantic Versioning 2.0.0

semver.org

https://semver.org/

Version posture quality and staleness signaling baseline.

CIT-INTERNAL-METHOD-01

internal

CLAWLINE Methodology

The CLAWLINE

/methodology

Versioned deterministic method and policy precedence source.

CIT-INTERNAL-CROSSWALK-01

internal

CLAWLINE Crosswalk Registry

The CLAWLINE

/crosswalk

External-to-internal term mapping registry.

CIT-INTERNAL-ORIGINS-01

internal

CLAWLINE Origins Registry

The CLAWLINE

/origins

Concept lineage, references, and policy linkage index.

Appendix

Registry coverage

Canonical terms: 21

Origin concepts: 12

Registry routes

Origins Crosswalk Glossary Methodology Whitepaper

Abstract

Keywords: agent security, static posture analysis, MCP integrity, chain of custody, least privilege mismatch, campaign containment, evidence-bound trust

Scope and Boundaries

In Scope

Static structural analysis of submitted disclosure artifacts and structured posture.
Deterministic finding emission and policy routing with versioned receipts.
Public trust labels constrained to neutral, evidence-grounded language.
MCP posture inventory, pinning quality, drift/collision checks.
Clinic-wide campaign meta-signals from privacy-preserving telemetry.

Out of Scope

Runtime exploit verification or active penetration testing.
Source-code execution, sandbox replay, or external URL retrieval during analysis.
Behavioral or intent attribution beyond deterministic structural evidence.
Legal/compliance certification claims.

Trust Boundaries

Boundary A: Submission ingress and payload validation.
Boundary B: Structured parsing and deterministic normalization.
Boundary C: Policy engine routing and action mapping.
Boundary D: Public publication surfaces and neutral language contract.

Method Summary

1. KILL2. FREEZE3. WARN4. ALLOW

phase_01_ingress

Ingress Guardrails

Reject oversized/unsafe submissions before policy analysis.

phase_02_structured_parsing

Structured Parsing

Normalize connectors, tools, MCP posture, custody evidence, versions, and schema signals.

phase_03_posture_scoring

Posture and Mismatch Scoring

Compute declared/required tiers, privilege delta, and clawpath structural risk.

phase_04_meta_signals

Clinic Meta-Signal Detection

Detect campaign patterns without evaluating payload intent text.

phase_05_decision_mapping

CRABS -> DAD -> CLAWS

Route deterministic findings to final publish action.

Threat Model

TM-00

chain-of-custody

Artifact Custody Gaps

Artifact trust decays when digests and provenance are missing, unbound, or unverifiable against local trust roots.

CRABS: CRABS-S80, CRABS-S81, CRABS-S82, CRABS-S83, CRABS-S86

DAD: DAD-CRT-80, DAD-CRT-81, DAD-WRN-80, DAD-WRN-81

TM-01

shadow_prompting_input_stack_integrity

Input Stack Manipulation

Untrusted context paths can carry instruction-bearing content that competes with declared policy intent.

CRABS: CRABS-C10, CRABS-C12, CRABS-C13, CRABS-C14, CRABS-A75, CRABS-A76, CRABS-A77

DAD: DAD-CRT-65

TM-02

tool_contamination definition_drift mcp_posture_and_pinning

Tool-Plane Contamination and Drift

MCP or tool metadata can inject hidden control language and mutate definitions after trust is granted.

CRABS: CRABS-C60, CRABS-C61, CRABS-S60, CRABS-S61, CRABS-S62, CRABS-A61

DAD: DAD-CRT-60, DAD-CRT-61, DAD-CRT-64, DAD-WRN-60, DAD-WRN-61

TM-03

declared_required_tier_delta

Privilege Understatement

Self-reported privilege posture can understate structural capability requirements and distort trust decisions.

CRABS: CRABS-A20, CRABS-A21

DAD: DAD-CRT-40, DAD-WRN-40

TM-04

clawpath_risk triad_condition

Composite Path Risk

Risk emerges when untrusted input, sensitive capability access, and sink channels coexist in one composition.

CRABS: CRABS-C70, CRABS-A70, CRABS-A71, CRABS-A72, CRABS-A73, CRABS-A74

DAD: DAD-CRT-62, DAD-CRT-63, DAD-WRN-62

TM-05

campaign_meta_signals review_required_containment

Clinic Abuse Campaigns

Coordinated submission bursts, loops, or identity spray can degrade registry trust without payload-level indicators.

CRABS: CRABS-B10, CRABS-B11, CRABS-B12

DAD: DAD-CRT-20

Claims

CLM-01

Execution-Safe Analysis Boundary

CLAWLINE analysis is static and deterministic; submitted code is not executed and external URLs are not fetched.

Why it matters: This boundary constrains analyzer-induced runtime risk and keeps outputs reproducible from submitted artifacts only.

Inputs: Submission payload JSON, Structured disclosure blocks, Static file content

CRABS: None

DAD: None

Validation artifacts: tests/api/checkup.route.test.ts, tests/lib/canonicalize.test.ts

CIT-INTERNAL-METHOD-01

Limits: Static analysis does not prove exploitability at runtime. Dynamic behavior can diverge from submitted configuration.

CLM-02

Evidence-Bound Integrity Prevents Overclaim

Integrity and posture confidence states are bounded by evidence quality and remain UNKNOWN/PARTIAL/ABSENT when incomplete.

Why it matters: Prevents false certainty in trust labels and keeps public outputs aligned with observed evidence.

Inputs: ABOM blocks, MCP blocks, Identity claims, Schema/version fields

CRABS: CRABS-S20, CRABS-S21, CRABS-S47

DAD: DAD-WRN-50

Validation artifacts: tests/lib/evidence.test.ts, tests/lib/identity.test.ts, tests/api/results.route.test.ts

CIT-NIST-AI-RMF-01 CIT-INTERNAL-METHOD-01

Limits: Evidence quality is bounded by declared and parseable sources. Unknown states are expected when structured posture is absent.

CLM-03

Least-Privilege Mismatch is Deterministically Gated

Declared vs Required tier mismatch is computed from structural posture and routed through explicit policy thresholds.

Why it matters: Privilege understatement is visible, queryable, and policy-enforceable before trust publication.

Inputs: declared_tier, connectors/scopes, tools/permissions, MCP permissions, autonomy flags

CRABS: CRABS-A20, CRABS-A21

DAD: DAD-WRN-40, DAD-CRT-40

Validation artifacts: tests/lib/posture.test.ts, tests/lib/dad.test.ts, tests/api/clinic.route.test.ts

CIT-NIST-80053-AC6 CIT-INTERNAL-METHOD-01

Limits: Required tier remains unknown when structured evidence is missing. Tier scoring does not represent full organizational risk acceptance.

CLM-04

MCP Posture Integrity is Tracked Over Time

MCP inventory, pinning, drift, and collisions are represented as deterministic posture signals and mapped into policy.

Why it matters: Tool-plane trust degrades when definitions drift or remain unpinned; explicit signals preserve auditability.

Inputs: MCP servers, MCP tools, Manifest hashes, Snapshot comparisons

CRABS: CRABS-S60, CRABS-S61, CRABS-S62, CRABS-A61

DAD: DAD-WRN-60, DAD-WRN-61, DAD-CRT-60, DAD-CRT-64

Validation artifacts: tests/lib/mcp.test.ts, tests/api/card.route.test.ts, tests/api/results.route.test.ts

CIT-MCP-01 CIT-SLSA-01 CIT-INTERNAL-ORIGINS-01

Limits: Drift detection depends on historical submissions in matching identity scope. Pinning grade quality is constrained by disclosed MCP detail.

CLM-05

ClawPath and Triad are Structural Conditions, not Intent Claims

ClawPath Risk and Triad Condition are composition signals derived from capability conjunctions, not behavioral accusations.

Why it matters: Supports defensible risk gating without over-interpreting intent from text alone.

Inputs: Untrusted input channels, Sensitive access signals, Sink channels, Autonomy/write signals

CRABS: CRABS-C70, CRABS-A70, CRABS-A71, CRABS-A72, CRABS-A73, CRABS-A74

DAD: DAD-WRN-62, DAD-CRT-62, DAD-CRT-63

Validation artifacts: tests/lib/clawpath.test.ts, tests/lib/dad.test.ts, tests/api/results.route.test.ts

CIT-CWE-693-01 CIT-NIST-AI-RMF-01 CIT-INTERNAL-CROSSWALK-01

Limits: A high structural score is not equivalent to proven exploitation. Operator controls outside submitted configuration are not directly measured.

CLM-06

Campaign Signals Enable Containment without Payload Accusation

Burst, resubmit-loop, and identity-spray detection use hashed telemetry and similarity to trigger containment.

Why it matters: Protects registry trust from coordinated abuse while preserving neutral public language.

Inputs: sourceIpHash, uaHash, contentSimhash, time windows

CRABS: CRABS-B10, CRABS-B11, CRABS-B12

DAD: DAD-CRT-20

Validation artifacts: tests/lib/campaign.test.ts, tests/api/clinic.route.test.ts, tests/api/checkup.route.test.ts

CIT-NIST-AI-RMF-01 CIT-INTERNAL-METHOD-01

Limits: Hashed telemetry cannot by itself attribute identity to a natural person. Containment decisions reflect pattern risk, not moral intent.

CLM-08

Chain-of-Custody Evidence is Artifact-Bound and Verification-Bounded

Custody assertions are attached to artifact digests and remain VERIFIED only when local trust roots can validate provenance signatures.

Why it matters: This prevents release-level hand waving and keeps trust outputs bounded by deterministic evidence.

Inputs: artifact_digests, build_provenance statement, signature material, local trust roots, sbom block

CRABS: CRABS-S80, CRABS-S81, CRABS-S82, CRABS-S83, CRABS-S84, CRABS-S85, CRABS-S86

DAD: DAD-CRT-80, DAD-CRT-81, DAD-WRN-80, DAD-WRN-81

Validation artifacts: tests/lib/custody.test.ts, tests/lib/dad.test.ts, tests/api/results.route.test.ts, tests/api/card.route.test.ts

CIT-SLSA-01 CIT-INTERNAL-METHOD-01 CIT-INTERNAL-ORIGINS-01

Limits: Offline verification does not fetch external trust infrastructure. Unsupported signature algorithms remain UNVERIFIED rather than over-claimed.

CLM-07

Public Output Language is Constrained by Policy

Public status and explanation text are constrained to neutral, provable labels (e.g., Review Required, Drift Detected).

Why it matters: Maintains public trust language discipline and prevents speculative or accusatory publication.

Inputs: CRABS findings, DAD decision, CLAWS action

CRABS: None

DAD: None

Validation artifacts: tests/api/card.route.test.ts, tests/api/results.route.test.ts, tests/api/trust-docs.route.test.ts

CIT-INTERNAL-METHOD-01 CIT-INTERNAL-ORIGINS-01

Limits: Neutral labels do not replace full technical analysis by operators. Downstream systems can still misinterpret labels if context is ignored.

Claim-Evidence Matrix

Claim	Condition	Signals	Policy Path	Public Output
CLM-08	custody invalid	CRABS-S82	DAD-CRT-80 -> FREEZE -> CLAWS: QUARANTINE	Review Required
CLM-08	high-risk without custody	CRABS-S80	DAD-CRT-81 -> FREEZE -> CLAWS: QUARANTINE	Review Required
CLM-03	delta >= 2	CRABS-A21	DAD-CRT-40 -> FREEZE -> CLAWS: QUARANTINE	Review Required
CLM-03	delta == 1	CRABS-A20	DAD-WRN-40 -> WARN -> CLAWS: PUBLISH_WITH_WARN	Published with Warnings
CLM-04	drift detected	CRABS-S62	DAD-CRT-60 -> FREEZE -> CLAWS: QUARANTINE	Review Required
CLM-06	campaign signal present	CRABS-B10/B11/B12	DAD-CRT-20 -> FREEZE -> CLAWS: QUARANTINE	Review Required
CLM-05	triad present without freeze co-factor	CRABS-A72	DAD-WRN-62 -> WARN -> CLAWS: PUBLISH_WITH_WARN	Published with Warnings
CLM-02	evidence incomplete	CRABS-S47 / S20 / S21	DAD-WRN-50 -> WARN -> CLAWS: PUBLISH_WITH_WARN	Published with Warnings
CLM-08	custody unverified	CRABS-S83	DAD-WRN-81 -> WARN -> CLAWS: PUBLISH_WITH_WARN	Published with Warnings

Evaluation Plan

EVAL-01

Validate canonicalization and deterministic hash stability.

Method: Submit equivalent payload permutations and compare canonical SHA outputs.

Expected outcome: Equivalent semantic payloads produce stable canonical hashes.

Validation artifacts: tests/lib/canonicalize.test.ts, tests/lib/simhash.test.ts

EVAL-02

Validate least-privilege mismatch routing.

Method: Exercise posture matrices that produce delta 0/1/2 and verify DAD routing.

Expected outcome: delta==1 warns, delta>=2 freezes by deterministic rule IDs.

Validation artifacts: tests/lib/posture.test.ts, tests/lib/dad.test.ts, tests/api/results.route.test.ts

EVAL-03

Validate MCP pinning/drift/collision handling.

Method: Feed MCP posture variants with pinned/unpinned/drifted definitions.

Expected outcome: Unpinned warnings and drift/collision freeze paths are emitted as configured.

Validation artifacts: tests/lib/mcp.test.ts, tests/api/card.route.test.ts, tests/api/clinic.route.test.ts

EVAL-04

Validate campaign containment signals.

Method: Replay burst and high-similarity submissions over bounded windows.

Expected outcome: B10/B11/B12 fire and route to quarantine.

Validation artifacts: tests/lib/campaign.test.ts, tests/api/checkup.route.test.ts, tests/api/clinic.route.test.ts

EVAL-05

Validate public-safe language and trust-doc consistency.

Method: Check API/page payloads for neutral status language and registry link consistency.

Expected outcome: No accusatory campaign wording on public outputs and consistent term mappings.

Validation artifacts: tests/api/card.route.test.ts, tests/api/trust-docs.route.test.ts, tests/api/origins.route.test.ts

EVAL-06

Validate chain-of-custody parsing and policy gating.

Method: Submit custody variants covering absent, partial, invalid, and unverified provenance bindings.

Expected outcome: Custody grades and DAD custody rules fire deterministically with neutral public labels.

Validation artifacts: tests/lib/custody.test.ts, tests/lib/dad.test.ts, tests/api/clinic.route.test.ts, tests/api/card.route.test.ts

Limitations

Findings are limited to submitted artifacts and deterministic parser coverage.
Unknown evidence states are normal and intentionally preserved to avoid overclaim.
Campaign heuristics are containment controls and do not prove malicious identity.
Version and policy semantics can evolve; receipts should always be interpreted with policy version context.
Public trust labels indicate structural posture state, not warranty or certification.