AI Explainability

1. Purpose

This document defines the explainability, safety, and governance framework for all AI systems developed and deployed by the organisation. It establishes the principles and requirements for ensuring that AI-driven outputs are transparent, understandable, and appropriately governed throughout their lifecycle.

The purpose of this framework is to:

Ensure transparency in AI-assisted decisions across clinical, operational, and administrative use cases, enabling appropriate human oversight and interpretation.
Support compliance with applicable legal, regulatory, and healthcare standards, including UK GDPR and relevant NHS clinical safety and procurement requirements.
Build and maintain trust with healthcare providers, patients, regulators, and auditors through clear, consistent, and auditable explanations of AI behaviour and outputs.
Reduce organisational and patient risk by identifying, documenting, and mitigating issues such as bias, unsafe recommendations, and opaque or non-interpretable automated decision-making.

2. Scope

This policy applies to all artificial intelligence (AI) and machine learning (ML) systems that are developed, deployed, integrated, or embedded within the organisation's SaaS platform. It covers any system that produces automated or semi-automated outputs that may influence clinical, operational, or administrative decisions.

2.1. In scope systems include

Predictive models, classification systems, recommendation engines, and optimisation algorithms
Retrieval-Augmented Generation (RAG) systems
Large Language Model (LLM)-based features
Any current or future AI or hybrid systems used within the platform
Any updates, retraining, tuning, or configuration changes that affect system behaviour or outputs
The full system lifecycle, including design, development, testing, deployment, monitoring, maintenance, and retirement

2.2. In scope stakeholders include

Internal personnel, including employees and contractors involved in AI development, deployment, or support
External suppliers, vendors, and delivery partners contributing to AI systems under contract
Healthcare customers, including hospitals, clinics, and care providers using the platform
End users and affected individuals, including patients where AI outputs may influence care or decisions
Relevant regulators and oversight bodies, including the Information Commissioner's Office (ICO), the Medicines and Healthcare products Regulatory Agency (MHRA), and other applicable authorities

2.3. Out of scope

Purely manual processes that do not involve AI, ML, or automated decision-making
Third-party systems that are not integrated into, controlled by, or operationally dependent on the SaaS platform, unless contractually required or directly contributing to AI-generated outputs within the platform

3. Security and Data Minimisation

3.1 Data Access Controls

All system access is restricted to authenticated users via secure access mechanisms. Role-based access control (RBAC) is enforced across the platform to ensure users can only access data and perform actions aligned with their assigned permissions.

All integrations, data connections, and external system interfaces must be reviewed and approved by the IT or security function prior to activation.

The AI system does not expand, infer, or expose access to data beyond what the user is already authorised to view in the underlying source systems. It operates strictly within existing permission boundaries.

3.2 Data Minimisation

The system processes only the minimum amount of data necessary to fulfil a user's request or query.

Data retrieval is scoped to explicitly permitted data sources and user-authorised records. No additional, unrelated, or excessive personal data is accessed or processed.

3.3 Data Synchronisation and Deletion

Data within the system is regularly synchronised with source systems to ensure accuracy and consistency.

Where data is updated or deleted in the source system, these changes are reflected in the AI system in line with applicable data retention and deletion policies. This ensures compliance with UK GDPR principles of accuracy and storage limitation.

3.4 Encryption and Infrastructure Security

All data is encrypted in transit and at rest using industry-standard encryption protocols.

Sensitive information, including credentials and authentication secrets, is securely stored using appropriate encryption and secret management practices.

The system is hosted within a secured, firewall-protected environment with strict access controls and monitored infrastructure security measures.

4. Fairness, Bias, and Discrimination Mitigation

4.1 Dataset Design and Testing

The system is built and tested using a combination of synthetic datasets and real-world datasets containing thousands of representative examples.

Test scenarios are designed in collaboration with care professionals to ensure they reflect real clinical and operational contexts, and to support fairness, safety, and domain relevance.

4.2 Bias Mitigation Approach

The system is designed to produce outputs grounded in source data through a context engineering approach that ensures responses remain anchored to relevant and verifiable information.

Outputs are generated using retrieved or referenced data and are not intended to express subjective opinion or unsupported assertions. To reduce the risk of hallucination, distortion, and bias, responses are consistently linked to underlying data sources and citations where applicable.

Fairness, bias, and representativeness are assessed through defined evaluation methodologies, including subgroup performance testing across protected characteristics under the Equality Act 2010, and monitoring for disparities in output quality and accuracy. These evaluations are conducted as part of the system's ongoing validation and governance process, with results reviewed at release and during periodic audits.

4.3 Human-Centred Safeguards

Care professionals are involved throughout the system lifecycle, including the design of test cases and the review of system outputs.

Where conflicting or inconsistent information is detected, the system clearly highlights discrepancies and presents relevant source data transparently.

The system is designed to support decision-making, not replace it. Final interpretation and decisions remain with qualified human users.

5. Accuracy and Performance Monitoring

5.1 Testing Framework

The system is validated through a structured testing approach, including unit tests and scenario-based evaluations using real-world care and operational examples.

All test scenarios are defined and reviewed in collaboration with care professionals to ensure clinical relevance, safety, and practical applicability.

5.2 Release Criteria

All changes, updates, or new features must pass at least 90% of the defined test scenarios before being released into production.

No deployment is permitted unless all validation and evaluation tests have been fully completed and formally approved.

This requirement is enforced by the Engineering Team under the oversight of the CTO, who holds ultimate responsibility for release readiness. Where evaluations fail, the AI Engineering Team is required to remediate and update the codebase until all tests are successfully passed.

5.3 Continuous Improvement

Any identified inaccuracies, defects, or unexpected behaviours are formally logged, tracked, and prioritised for resolution.

Relevant cases are added to the ongoing evaluation and regression test suite to prevent recurrence.

All previously identified issues must pass regression testing before subsequent releases are approved.

5.4 Monitoring and Logging

The system is continuously monitored to ensure performance, reliability, and issue detection.

Monitoring and incident management are supported through:

Grafana for system performance monitoring and observability
Jira Service Management for incident tracking, investigation, and resolution
Email-based escalation channels managed by account and support teams

All incidents are reviewed, categorised, and used to inform ongoing system improvements.

6. Individual Rights and Human Oversight

6.1 Transparency and Explainability

All AI-generated outputs are produced with the underlying retrieval context attached, enabling transparency and explainability. This includes visibility of the source data used to generate the response, ensuring outputs can be traced back to their originating context and supporting information.

6.2 Human-in-the-Loop Controls

Human oversight is embedded within the system, particularly for high-risk or sensitive use cases.

Users are able to:

Review AI-generated outputs
Access and validate the supporting evidence
Override, ignore, or challenge responses where appropriate

The system is designed to support, not replace, human judgement.

6.3 Support for Data Subject Rights

The system is designed with capabilities intended to support the exercise of key data subject rights under UK GDPR, including access, rectification, erasure, and restriction of processing, subject to applicable legal, technical, and organisational constraints.

These controls ensure that AI functionality remains aligned with data protection obligations.

7. AI System Architecture and Decision-Making Process

7.1 Overview

The system is built using a Retrieval-Augmented Generation (RAG) architecture. This approach combines structured data retrieval with controlled AI generation to ensure outputs are accurate, traceable, and grounded in source data.

7.2 Data Processing and Storage

When data is ingested into the system, it is broken down into smaller, structured chunks to enable efficient retrieval.

Data is stored in a format optimised for search and relevance ranking, while preserving the original meaning, context, and structure of the source material.

7.3 Query Processing

When a user submits a query, the system analyses and, where necessary, decomposes it into smaller sub-queries.

Only data sources that the user is authorised to access are searched. Retrieved results are then ranked, filtered, and prioritised based on relevance to the query.

7.4 Response Generation

Responses are generated using a constrained, low-temperature AI model designed to prioritise accuracy over fluency.

The system:

Generates outputs intended to be grounded in retrieved data, constrained by the RAG pipeline and validation controls.
Is designed to minimise introduction of external or fabricated information by restricting generation to retrieved and validated context.
Is designed to reduce speculation and unsupported claims by limiting outputs to available and validated source context.

All responses are grounded in retrieved content and include clear citations to the underlying data.

7.5 Explainability by Design

Explainability is built into the system by default. All AI-generated outputs are designed to be transparent and traceable, with responses including citations or references to the underlying source data used to generate the answer. This ensures users can verify outputs against their originating context and supporting evidence.

All responses are:

Traceable back to their original source documents
Verifiable by the user through accessible references

The system prioritises deterministic, evidence-based outputs over open-ended or generative responses, ensuring consistency, auditability, and user trust.

8. Guidance

8.1 Appropriate Use and Clinical Responsibility

Clinical decision support only: The AI system is intended solely as a support tool for information retrieval, summarisation, and administrative assistance. It must not be used to autonomously make clinical decisions, determine diagnoses, or prescribe treatments.
Professional accountability: Clinical responsibility remains fully with the qualified healthcare professional. AI outputs must be treated as advisory and not authoritative.
Managing automation bias: Users must remain vigilant against over-reliance on AI outputs and should critically assess all responses, particularly in high-risk or ambiguous scenarios.

8.2 Mandatory Verification and Citation Auditing

Source verification required: Users must validate AI-generated outputs by reviewing the cited source data. Summaries must not be relied upon without checking the underlying records or documentation.
Review before finalisation: For any clinical documentation (e.g., notes, letters, transcriptions), users must fully review, edit, and confirm accuracy before use or sharing.
Error identification and reporting: Users must remain alert to potential issues such as omissions, misinterpretations, or unsupported content. Any discrepancies must be corrected in the workflow and formally reported through designated support channels.

8.3 Information Governance and Safe Use

No speculative prompting: Users must not instruct the system to speculate, guess, or generate clinical inferences beyond the available source data.
Data boundary compliance: All use must remain within the secure, authenticated platform. Patient-identifiable information (PII/PHI) must not be entered into external, unapproved, or public AI tools.
Secure handling of data: Users must follow all organisational information governance and data protection policies when interacting with the system.

9. Summary

The AI system is designed with:

Strong data protection and data minimisation controls
Built-in bias mitigation and fairness safeguards
Rigorous accuracy testing and continuous performance monitoring
Clear human oversight and explainability mechanisms

These measures ensure alignment with key UK GDPR principles, including:

Lawfulness, fairness, and transparency
Accuracy
Data minimisation
Integrity and confidentiality
Accountability