Appendix D: Risk Scoring Matrix
A comprehensive methodology for assessing AI system risks based on probability of occurrence and severity of impact, aligned with enterprise risk management frameworks and EU AI Act risk classifications.
D.1 Introduction to AI Risk Scoring
The Risk Scoring Matrix provides a standardized methodology for evaluating and prioritizing AI-related risks across the organization. This framework enables consistent risk assessment during the AI lifecycle, from initial ideation through deployment and monitoring.
Purpose of Risk Scoring
- Prioritization: Determine which risks require immediate attention vs. monitoring
- Resource Allocation: Guide investment in risk mitigation based on severity
- Governance Triggers: Define escalation thresholds for RAI Council review
- Regulatory Alignment: Map internal risk levels to EU AI Act classifications
- Communication: Provide a common language for discussing risk across stakeholders
📐 Core Risk Calculation Formula
Where:
- Likelihood (L): Probability of the risk event occurring (1-5)
- Impact (I): Severity of consequences if the risk materializes (1-5)
- Detectability Factor (D): Ability to detect the risk before harm occurs (0.5-1.5)
This produces a composite score ranging from 0.5 to 37.5, which is then mapped to risk levels.
D.2 The 5×5 Risk Matrix
The foundational risk matrix maps Likelihood against Impact to produce a base risk score. This is the primary tool for initial risk classification.
| LIKELIHOOD ↓ / IMPACT → |
IMPACT SEVERITY | |||||
|---|---|---|---|---|---|---|
| 1 - Negligible | 2 - Minor | 3 - Moderate | 4 - Major | 5 - Catastrophic | ||
| 5 | Almost Certain (>90%) |
5 MEDIUM |
10 HIGH |
15 HIGH |
20 CRITICAL |
25 CRITICAL |
| 4 | Likely (70-90%) |
4 LOW |
8 MEDIUM |
12 HIGH |
16 HIGH |
20 CRITICAL |
| 3 | Possible (30-70%) |
3 LOW |
6 MEDIUM |
9 MEDIUM |
12 HIGH |
15 HIGH |
| 2 | Unlikely (10-30%) |
2 MINIMAL |
4 LOW |
6 MEDIUM |
8 MEDIUM |
10 HIGH |
| 1 | Rare (<10%) |
1 MINIMAL |
2 MINIMAL |
3 LOW |
4 LOW |
5 MEDIUM |
Risk Level Definitions
| CRITICAL | Score: 20-25 — Unacceptable risk requiring immediate action. May trigger "stop the line" authority. Requires executive approval to proceed. |
|---|---|
| HIGH | Score: 10-19 — Significant risk requiring active management and mitigation. RAI Council review mandatory. Enhanced monitoring required. |
| MEDIUM | Score: 5-9 — Moderate risk requiring documented mitigation plans and regular monitoring. Standard governance processes apply. |
| LOW | Score: 3-4 — Low risk that can be managed through standard procedures. Periodic review recommended. |
| MINIMAL | Score: 1-2 — Negligible risk requiring no special action. Document and continue normal operations. |
D.3 Likelihood Assessment Criteria
Use the following criteria to assess the probability that a specific AI risk will materialize:
Level 5: Almost Certain (>90% probability)
| Indicators |
|
|---|---|
| AI Examples |
|
Level 4: Likely (70-90% probability)
| Indicators |
|
|---|---|
| AI Examples |
|
Level 3: Possible (30-70% probability)
| Indicators |
|
|---|---|
| AI Examples |
|
Level 2: Unlikely (10-30% probability)
| Indicators |
|
|---|---|
| AI Examples |
|
Level 1: Rare (<10% probability)
| Indicators |
|
|---|---|
| AI Examples |
|
D.4 Impact Assessment Criteria
Assess impact across multiple dimensions, using the highest applicable severity level:
Level 5: Catastrophic Impact
| Financial | >$10M direct loss, or material impact to market value |
|---|---|
| Operational | Complete business unit shutdown, >30 days to recover critical systems |
| Reputational | International media coverage, sustained public outrage, executive resignation |
| Regulatory | Criminal prosecution, license revocation, regulatory shutdown order |
| Safety/Rights | Loss of life, permanent disability, mass civil rights violations |
| AI Examples |
|
Level 4: Major Impact
| Financial | $1M-$10M direct loss, significant profit impact |
|---|---|
| Operational | Major process disruption, 7-30 days recovery, significant resource diversion |
| Reputational | National media coverage, customer trust significantly damaged, partner concerns |
| Regulatory | Significant fines, formal enforcement action, mandatory remediation |
| Safety/Rights | Serious injury, significant civil rights violation, documented harm to individuals |
| AI Examples |
|
Level 3: Moderate Impact
| Financial | $100K-$1M direct loss, noticeable budget impact |
|---|---|
| Operational | Process degradation, 1-7 days recovery, overtime required |
| Reputational | Trade press coverage, customer complaints, internal morale impact |
| Regulatory | Formal warnings, compliance audit triggered, documentation required |
| Safety/Rights | Minor injury, individual rights violation, distress caused |
| AI Examples |
|
Level 2: Minor Impact
| Financial | $10K-$100K direct loss, within operational budget |
|---|---|
| Operational | Temporary inconvenience, <24 hours recovery, workarounds available |
| Reputational | Limited external awareness, some customer complaints, quickly resolved |
| Regulatory | Informal inquiry, self-reported issue, no formal action |
| Safety/Rights | No physical harm, inconvenience or frustration, easily remediated |
| AI Examples |
|
Level 1: Negligible Impact
| Financial | <$10K direct loss, absorbed in normal operations |
|---|---|
| Operational | Minimal disruption, immediate recovery, no workarounds needed |
| Reputational | No external awareness, internal only, quickly forgotten |
| Regulatory | No regulatory implications |
| Safety/Rights | No harm, no complaints |
| AI Examples |
|
D.5 Detectability Factor (Risk Modifier)
The Detectability Factor adjusts the base risk score based on how quickly and reliably the risk can be detected before significant harm occurs.
| Factor | Detectability Level | Description | AI Examples |
|---|---|---|---|
| 1.5× | Undetectable | No mechanism to detect the risk before harm occurs. Discovery only after significant damage. | Subtle bias emerging over time; slow model drift; privacy violations with no audit trail |
| 1.25× | Difficult to Detect | Detection possible but requires specialized effort or occurs after partial harm. | Complex adversarial attacks; sophisticated prompt injection; data poisoning in training |
| 1.0× | Moderately Detectable | Standard monitoring will likely detect the risk, but not immediately. | Performance degradation; obvious hallucinations; standard security events |
| 0.75× | Easily Detectable | Robust monitoring and alerting systems provide early warning. | Input validation failures; rate limit violations; known attack signatures |
| 0.5× | Immediately Detectable | Real-time detection with automatic response. Risk is caught before any harm. | Hard-coded guardrails blocking prohibited content; circuit breakers; kill switches |
📊 Adjusted Risk Score Calculation
Example: A risk with Likelihood=4, Impact=3, and Difficult Detectability:
The same risk with robust detection (Factor=0.75):
D.6 Risk Response Actions by Level
| Risk Level | Response Actions | Governance Requirements | Timeline |
|---|---|---|---|
| CRITICAL |
|
|
Immediate action within 24 hours |
| HIGH |
|
|
Mitigation plan within 7 days |
| MEDIUM |
|
|
Mitigation plan within 30 days |
| LOW |
|
|
Address in normal project cycle |
| MINIMAL |
|
|
No deadline |
D.7 AI-Specific Risk Categories
When applying the risk matrix, consider these AI-specific risk categories:
🎯 Fairness & Discrimination Risks
| Risk Type |
|
|---|---|
| Key Factors |
|
🔒 Security & Adversarial Risks
| Risk Type |
|
|---|---|
| Key Factors |
|
⚡ Reliability & Performance Risks
| Risk Type |
|
|---|---|
| Key Factors |
|
🔐 Privacy & Data Protection Risks
| Risk Type |
|
|---|---|
| Key Factors |
|
📋 Compliance & Legal Risks
| Risk Type |
|
|---|---|
| Key Factors |
|
D.8 Mapping to EU AI Act Risk Categories
The internal risk scoring framework aligns with EU AI Act classifications:
| Internal Rating | EU AI Act Category | Regulatory Requirements |
|---|---|---|
| CRITICAL (20-25) | Prohibited or High-Risk (Annex III) |
|
| HIGH (10-19) | High-Risk (Annex III) |
|
| MEDIUM (5-9) | Limited Risk |
|
| LOW (3-4) | Minimal Risk |
|
| MINIMAL (1-2) | Minimal Risk |
|
D.9 Worked Examples
Example 1: AI Resume Screening Tool
Context: Automated resume screening for technical positions, affects hiring decisions.
| Factor | Assessment | Score |
|---|---|---|
| Likelihood | Likely (4) - Historical data shows gender and ethnic bias in resume screening is common without mitigation | 4 |
| Impact | Major (4) - Employment discrimination has significant legal, reputational, and individual harm potential | 4 |
| Base Score | 4 × 4 = 16 | |
| Detectability | Difficult to Detect (1.25) - Bias may emerge gradually and require specialized analysis | 1.25× |
| Final Score | 16 × 1.25 = 20 → CRITICAL RISK | |
Required Actions: RAI Council review, comprehensive bias testing across all protected characteristics, mandatory human review of AI decisions, continuous fairness monitoring, EU AI Act high-risk compliance.
Example 2: Customer Service Chatbot
Context: LLM-powered chatbot for general customer inquiries, no financial transactions.
| Factor | Assessment | Score |
|---|---|---|
| Likelihood | Possible (3) - LLMs can occasionally provide incorrect information | 3 |
| Impact | Minor (2) - Customer inconvenience, easily corrected, no financial impact | 2 |
| Base Score | 3 × 2 = 6 | |
| Detectability | Easily Detectable (0.75) - Customer feedback, conversation logs, quality sampling | 0.75× |
| Final Score | 6 × 0.75 = 4.5 → LOW RISK | |
Required Actions: Standard guardrails, AI disclosure to users, escalation path to human agents, periodic quality review.
Example 3: Medical Diagnosis Support
Context: AI system providing diagnostic suggestions to physicians for rare diseases.
| Factor | Assessment | Score |
|---|---|---|
| Likelihood | Possible (3) - Rare disease diagnosis is inherently difficult | 3 |
| Impact | Catastrophic (5) - Incorrect diagnosis could lead to patient harm or death | 5 |
| Base Score | 3 × 5 = 15 | |
| Detectability | Moderately Detectable (1.0) - Physician review, but subtle errors may be missed | 1.0× |
| Final Score | 15 × 1.0 = 15 → HIGH RISK | |
Required Actions: Mandatory HITL with physician final decision, comprehensive clinical validation, EU AI Act high-risk compliance, FDA/CE medical device requirements, robust uncertainty quantification.
🧮 Interactive Risk Calculator
Use this calculator to quickly assess AI system risks:
🔗 Related Framework Components
- Section 3.1: AI Risk Tiering System - Classification framework
- Section 3.2: Impact Assessment Methodology - Assessment process
- Appendix A: Algorithmic Impact Assessment - Full assessment template
- Section 2.3: Escalation Pathways - When to escalate risks