Model corrections

Explainability Requirements Met

Ex. Requirements satisfied at each stage of development/deployment

Pre-Development

Preliminary Requirements Set

Types of Ex. needed?

Who needs explanations?

Who's responsible for providing explanations and implementing ex. techniques?

Existing systems to use as a guide?

Hazards Identified

Risks of lack of explainability?

What types of explainability needed to mitigate hazards?

Evidence to support sufficiency of explainability

Evaluation of Ex. Techniques Planned

e.g. human tests

During Dev.

Model Learning

Model Selection

Could a more ex. model be used?

Accuracy/explainability tradeoffs

What types of ex. are possible for potential models?

Do they meet requirements?

Explainability techniques planned

Techniques appropriate for model

Techniques Provide appropriate types of Ex.

Model Verification

Errors identified and appropriately explained/corrected

Explanations used to provide insights into operation domain

Explanations used to justify results provided by model

Deployment

Acceptable safety is assured in case of misuse and unavoidable accidents

Potential harm from misuse is mitigated

Explainability allows insight to foresee and limit potential for misuse

Potential harm from accidents is mitigated

Explainability allows contestability to challenge harmful model decisions

Global Ex. allows predictability so accidents can be predicted

Potential harm from adversarial attacks is mitigated

Users trust model

Users satisfied with explanations

Legal requirements met

Requirements

User Needs

Fairness

Trust

Answer causal/contrastive questions

Legal Needs

How do certain input factors affect outputs?

c1

Users have a "right to explanation"

c1

Explain Mistakes/accidents

Technical Needs

Explainability

c1

Local

Can individual decisions be explained?

Analyse why a particular problem occurred

Improve/correct model

Global

Generate explanations about the whole logic of the model

Inform design

Make predictions about future outputs

Robustness

Transparency

Relate inputs to outputs

Relate parameters to human-understandable concepts

Justification

Predictability

Do explanations allow us to predict model behavior?

Contestability