Model provenance & integrity audit · Black Sheep AI

Watchman audit report

Audit WM-20260612-9abc1dd9 · 2026-06-12T23:16:49.182583+00:00 · Watchman v0.2.1 · library v1

Models under audit

rolesourceprecision
base/data/models/Qwen2.5-1.5Bfull precision
candidate/data/models/Qwen2.5-1.5B-Instructfull precision
controlcandidate is full precision; raw functional diff is used

Findings

detection featurevaluethreshold (fit from clean/compression class)
top-10% excess concentration0.461≥ 0.3138
log10 total excess-2.596≥ -0.3068
peak localized change (robust score)64.5(reported for context)

Tensors compared: 197 of 197 2-D weight tensors (coverage 100%); measurement settings recorded in the machine-readable report; differential control not used (full-precision candidate).

where the change concentrates (share of excess by tensor role)

v_proj
31.0%
k_proj
26.0%
embed
19.6%
down_proj
7.4%
o_proj
7.3%
q_proj
6.2%
up_proj
1.3%
gate_proj
1.2%

Depth center 0.52 (0 = first layer, 1 = last), spread 0.30.

most-changed tensors (excess over matched control)

#tensorexcess divergence
1embed_tokens.weight0.000497
2layers.25.self_attn.k_proj.weight0.000061
3layers.27.self_attn.k_proj.weight0.000045
4layers.0.self_attn.v_proj.weight0.000045
5layers.1.self_attn.v_proj.weight0.000042
6layers.2.self_attn.v_proj.weight0.000040
7layers.17.self_attn.k_proj.weight0.000040
8layers.21.self_attn.k_proj.weight0.000036
9layers.24.self_attn.k_proj.weight0.000036
10layers.6.self_attn.v_proj.weight0.000035
11layers.14.self_attn.v_proj.weight0.000034
12layers.19.self_attn.k_proj.weight0.000034
13layers.3.self_attn.v_proj.weight0.000033
14layers.16.self_attn.v_proj.weight0.000032
15layers.23.self_attn.k_proj.weight0.000032
16layers.20.self_attn.k_proj.weight0.000032
17layers.12.self_attn.v_proj.weight0.000032
18layers.19.self_attn.v_proj.weight0.000031
19layers.25.self_attn.v_proj.weight0.000031
20layers.26.self_attn.k_proj.weight0.000031
21layers.13.self_attn.v_proj.weight0.000030
22layers.11.self_attn.v_proj.weight0.000030
23layers.17.self_attn.v_proj.weight0.000029
24layers.15.self_attn.v_proj.weight0.000029
25layers.9.self_attn.v_proj.weight0.000028

Compliance mapping

Watchman produces audit evidence supporting these obligations; it does not by itself make a system compliant.

frameworkthis audit supportsevidence in this report
EU AI Act (Reg. 2024/1689): GPAI / Annex III Art. 11 technical documentationmodel identity and lineage record; verification of third-party base-model claims; documented method limitationsmodels.*.files (SHA-256 chain of custody)
verdict + classification
library.validation_loo
known-limitations appendix
US FY2026 NDAA / DFARS: AI/ML weight security; integrity check before deploymentpre-deployment integrity gate (CI exit codes 0/2/1); registry-pinnable weight hashes; per-release attestationverdict.exit_code
models.candidate.files
attestation.cdx.json
OMB M-26-04: continuous accountability for federal AIscheduled re-audits; deterministic comparison over timeaudit_id + timestamp_utc series
recorded, reproducible analysis settings
NSA AI supply-chain guidance (Mar 2026): model-layer controlsthird-party model intake verificationmodels.base/candidate provenance
verdict
AI-BOM (CycloneDX/SPDX) procurement artifactmodel name / version / weights-identifier / lineage entry with verified provenanceattestation.cdx.json
US banking MRM (OCC/Fed/FDIC, Apr 2026): third-party model validationindependent validation evidence for vendor and open-weight modelsfull report + limitations (validation record)

Chain of custody

base

filebytessha256
config.json6840e8c8aa86468aba09c9d32157ff4bc2301c7e6c50e4398960425b2ea71e66f77
model.safetensors3,087,467,144a961db72e75d52b18e6b0c9d379e51a26973b233385e0e127fdda7d648aec796

candidate

filebytessha256
config.json66098d2ff8cc47488d08a2b0b3acf4eb99ef210779b42bd48605f6b8e36acdbf670
model.safetensors3,087,467,144dd924a11b4c220f385b51ffa522daea7c9f3d850e31b162bb5661df483c6d3ee

analysis pipeline

propertyvalue
pipeline bundle digest (SHA-256 over the pinned analysis modules)4c5d08117650f2c965d806e7e0171a719289ad81037ff586540557f7c17a4e94
integrityevery analysis module is hash-pinned; this digest changes if any module changes

environment

python3.12.0
platformarm64 workstation (Apple Silicon)
mlx0.31.1
mlx-lm0.31.2
numpy2.4.4
huggingface-hub1.7.1

Methodology

Weights-direct measurement. Watchman reads the model's weight files directly. No training data, no prompts, no inference access, and no cooperation from the publisher. Each audit is deterministic: identical inputs and recorded settings produce an identical verdict, so any party can independently reproduce it.

Matched-compression differential. When the candidate ships quantized, Watchman measures it against an independently quantized control of the claimed base at the candidate's own declared settings. Compression effects cancel almost completely, and what remains is unexplained change. Full-precision candidates are measured against the claimed base directly.

Two-stage decision. A modification is flagged when the unexplained change exceeds what compression alone is ever observed to produce. The decision thresholds are fitted from a labelled, versioned reference library and printed in this report. A flagged modification is then classified by matching its signature against the library's labelled modification types.

Reference library

Reference library v1 contains 20 labelled signatures: alignment_modification ×4, clean ×1, domain_finetune ×4, instruction_tuning ×6, quantization ×5. Leave-one-out validation: detection 18/20, characterization 11/12 of detected.

Known limitations

Audit WM-20260612-9abc1dd9, generated by Watchman v0.2.1, Black Sheep AI. This report describes defensive model-provenance and integrity auditing. The verdict is a statistical measurement against the cited reference library, with the limitations stated above; it is evidence, not a guarantee. Machine-readable form: audit_report.json.